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Preface 


Though mathematicians in their publications like to present their subject 
as if it were a deductive science, they will readily admit that back of their 
postulates and theorems are methods of discovery which are largely induc- 
tive. Postulates are not arbitrarily made but are constructed so that 
certain desirable consequences will follow. Proofs validate conjectures 
already made on the basis of experience and experiment, In fact, dis- 
covery of correct results often follows the sequence: A guess on the basis 
of experiment, an attempt at proof which fails but suggests a modified 
guess, which leads to another attempt at proof, and so on. One of the 
charms of mathematics is the pervading spirit of discovery. It was with 
these things in mind that this book was written. 

Throughout, a definite effort is made to encourage and equip the 
student to make discoveries for himself. For the most part, reasons 
for postulates are developed before they are formulated. The statement 
and proof of some of the more important theorems are left to the student 
after preparation in similar situations. Problems are presented and a 
theory developed to answer the question raised, beginning especially 
in the second chapter. 

First, this book makes no attempt to take the place of the teacher. 
It is, rather, his instrument. For instance, at the beginning, postulates 
and definitions are stated for the natural numbers. It is to be expected 
that the class will begin rather by formulating its own postulates and 
definitions and, on reference to the text, will find that its own are essen- 
tially equivalent to those printed. The limitations of a book for such 
a method are apparent. The text is not supposed to be “read like a 
novel.” One recommended method is for the student to conceal the proof 
of a theorem and try to give one himself; failing that, he can reveal the 

vit 
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first two lines and try to proceed from there. This is a slow, but excellent, 
way of laying a firm foundation. An attempt is made to give material 
in some detail at first but less so later on in the hope that by that time the 
student will have acquired greater facility. 

In any case, the pace should be very slow throughout the book. The 
author found, for instance, that the first chapter required about twenty 
class periods. Most of the book has been tried out in a classroom at one 
time or another. 

In writing the first three chapters of the book the author partly had 
in mind those who are planning to teach in secondary and elementary 
schools. They need discussion of the number system which, after all, 
forms the basis for algebra as well as the manipulations of numbers them- 
selves. Number theory contains countless interesting facts, which can 


be used to enliven and clarify elementary teaching of mathematics. 


For instance, repeating decimals can well be used when the elementary 
student first comes face to face with division by a decimal fraction. 
Equations with integer solutions arise from many puzzle problems as well 
as from the desire of a teacher to make her problems ‘“‘come out right.” 
In short, the first three chapters especially are rich in material useful 
for the secondary and elementary school teacher. 

This book also gives an adequate introduction to the theory of numbers 
and its methods for those students majoring in mathematics or beginning 
graduate work. When used for such students much of the first chapter 
may be omitted. If taken slowly, it should be of assistance in bridging 
the gap between problem-solving mathematics and mathematics which 
develops a theory. 

Careful reading of the manuscript by Dr. Carroll V. Newsom and Dr. 
Aubrey J. Kempner has resulted in many improvements which the author 
acknowledges with deep gratitude. 

The theory of numbers seems to have a fascination for the layman 
and mathematician alike. It is hoped that this book may add to the 
multitude of its disciples. 


Burton W. JONES 
The University of Colorada 


January, 1956 
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The Development of the 
Number System 


1.1. Introduction 


The theory of numbers concerns itself with the properties of integers or 
whole numbers; and, because it is concerned with integers, it has on the 
one hand the intrinsic interest that is associated with the solution of 
puzzles and on the other hand the inherent difficulty imposed by the 
restriction to integers. This explains the fact not only that amateurs 
play with the theory of numbers more than with any other branch of 
mathematics but also that most mathematicians have at various times 
worked in this field. 

Some results in number theory are easy to state and easy to prove. 
For example, it is true that if any integer is subtracted from its cube, the 
result is divisible by 6; that is, “ἢ — x is divisible by 6 if z is an integer. 
This is true, since 

e— a= (x— l)a(x+1). 
This equation shows that z* — x is the product of three consecutive in- 
tegers. At least one of these integers must be even and at least one 
divisible by 3, which shows that the product is divisible by 6. 

Some results are easy to state but hard to prove. For example, recall 
that a prime number is an integer greater than 1 which is divisible only by 
itself and 1, the first ten prime numbers being 

2,3, 5, 7, 11, 13, 17, 19, 28, 29. 
(An integer greater than 1 and not a prime is called composite.) Bertrand’s 
1 
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postulate states that between any number greater than 1 and its double 


there is at least one prime number. For instance, between 3 and 6 is the | 
prime 5, between 4/2 and 24/2 is the prime 2. This general fact has been — 


proved, but the proof is rather long and difficult. 

Some rather apparent conjectures are easy to state but have not been 
proved; in fact, what amounts to the same thing to a mathematician, we 
do not know whether or not they are true. For instance, two primes are 
called twin primes if they differ by 2. Thus 3 and 5, 11 and 13, and 17 and 
19 are twin primes. Is there a largest couple of twin primes? No one 
knows, though there is good evidence to support the belief that there is 
none. 

Some propositions are hard to understand and hard to prove, such 
as the following: Two quadratic forms are congruent in the rational field 
if they are congruent in the field of reals and in all p-adic fields. There 
is no point in trying to explain this here. 

Instead of presenting a theory to the student in this book, every 
effort is made to encourage him to approach the subject from certain 


leading problems in the hope that he can to a large extent under such | 


guidance develop much of the theory himself. Though on many occasions 
the solutions to exercises in one section may be found in the next, it is to 
be hoped that the student will not attempt to read in advance but will 
try to solve the exercises, using only what has gone before. In that way 
he will not only make the subject more interesting because it is his own 
but will find his grounding more sure. 


Moreover, the emphasis here is not on solving numerical exercises © 


but in exploring the properties of numbers. These results can be stated 
most clearly in the form of theorems and corollaries. The function of the 
exercises is to illustrate the theory, to find new results, or to lead to con- 
jectures on the basis of numerical evidence which may later be estab- 
lished by proof. The mathematician’s exploration often takes the form 
of guessing results from one kind of evidence or another and then validat- 
ing the guesses by proof. In this chapter we shall explore the funda- 
mental properties of numbers beginning with the integers. 


1.2. Fundamental properties of natural numbers 


The natural numbers sometimes called positive integers are what we 
use to count with. For instance, the number which we call three is the 
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most important thing common to three hats, three houses, and three 
lives. The system of natural numbers: 1,2,3,4,5,6, --- is a universal 
tally device by which we can tell whether there are more things in one 
category than in another. It is an abstraction independent of the lan- 
guage we use to describe it. There is no great point in trying to define 
natural numbers beyond this; we may regard it as an undefined concept 
which we assume everyone understands. 

Thus with each set of objects we can associate a natural number 
which describes how many objects are in the set. The natural numbers 
associated in this way with two sets A and B will be said to be the same 
if we can set up a one-to-one correspondence between the objects of A and 
those of B, that is, so that each object of A has one and only one “mate” 
in B and there are no “bachelors” nor “spinsters.” For instance, a good 
way to discover if there are just as many boys as girls in a party is to 
form a procession with each boy taking a girl. If there are no boys or 
girls left without partners, we know that the number of boys is equal to 
the number of girls. Furthermore, no matter how partners were chosen, 
this result would be the same. In general, then, if such a correspondence 
can be established, the natural number associated with one set will be 
the same as (that is, equal to) that associated with the other. 

Equality of natural numbers has four fundamental properties: 


1. For any natural number 4, it is true that a = a (reflexive). 

2. If a = ὃ, then ὃ = a (symmetric). 

3. If a= b and ὃ = ¢, then a = c (transitive). 

4. For any a and ὃ, either a = ὃ or a γέ ὃ (determinative). In each 
case a,b,c are assumed to be natural numbers. 


Addition of two numbers can be defined as follows: If the number of 
things in one set is c and in another set is d, then we call the number of 
things in both sets together the sum of c and d, and write itc+d. Asa 
result of this definition and those above, addition has four properties: 


la. The sum of two natural numbers is a natural number (closure). 
2a. If a = ὃ, thena+c=b-+c (addition is well defined). 

3a. a+b = b+ a (addition is commutative). 

4a. (a+b) +c=a+ (ὃ - οὐ (addition is associative). 


The commutative property holds, since the order in which we count the 


| 
. 
| 
| 
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two sets makes no difference in the sum, and the closure property la 
follows from our definition of natural number in terms of counting. For 
the associative property as shown above, if we denote by A,B,C, sets 
associated with the numbers a,b,c, respectively, we on the one hand 
combine C' with the combination of A and B, while on the other hand we 
combine A with the combination of B and C. Again our definition shows 
that the results are the same. 

Multiplication of two natural numbers can be defined as follows: 
If there are ὃ sets of things with c things in each set, we define “‘b times 
c” or “the product of ὃ and c’”’ to be the total number of things in the b 
sets. The product of ὃ and c is written ὃ X ς, b-c or merely be. Schemat- 
ically, if we arrange the b sets in rows we have 


ce things 
b sets 


Then ‘‘b times c” is understood to be that number obtained by counting 
by rows the total number of dots in the rectangular array. In the first 
place, the result must be a natural number (the closure property). Sec- 
ondly, a = ὃ implies ac = be, since then a rows of c dots will contain just 
as many dots as ὃ rows of ὁ dots. Thirdly, we could count the dots by 
columns which would be ‘“‘c times b” which shows that the commutative 
property for multiplication holds. 


In order to deal with the associative property of multiplication, con- 


sider a stack of trays of glasses in a cafeteria. Suppose there are r trays 
each having s rows of glasses with ¢ glasses in each row. Then by our 
definition, the number of glasses on each tray would be st. Thus we have 
r trays of st glasses, that is, r sets with st in each set, and the total number 
of glasses would be r(st). On the other hand, if we look at the stack from 
the front we will see r rows of glasses with s in each row, that is rs glasses; 
this will be repeated ¢ times as we go back from the front. Thus we will 
have (rs)t different glasses. Thus we have r(st) = (rs)t, the associative 
property for multiplication. 

Finally, there is a property involving both processes, the distributive 
property: α(ὖ -᾿ 6) - αὖ - αο. We leave the proof of this in terms of 
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counting as an exercise. Note that the commutative property of multi- 
plication implies also that ab + ac = (ὁ + c)a = ba + ca. 

To summarize, we have given for natural numbers 
Definitions: natural number, equality, addition, multiplication. 


PROPERTIES ADDITION MULTIPLICATION 


Well defined a=bimpliesa+c=b+c a=b implies ac = be 


Closure sum of two natural product of two natural 

numbers a natural number numbers a natural number 
Commutative a+b=b+a ab = ba 
Associative (a+b) +c=a+(b+c) (ab)c = a(bc) 
Distributive ab + ac = a(b +c) 


ba + ca = (b+ c)a 


Throughout the above table the letters stand for natural numbers. 
We shall find that the properties are shared by other kinds of numbers if 
equality, addition, and multiplication are properly defined. Notice that 
multiplication and addition are not interchangeable in the distributive 
property; thatis,a + (be) (a+ b)(a+c). 


Exercises 1.2 


1. Which of the above properties are used in adding a column of figures? 
Explain. 
2. Prove the distributive property from the definitions given. 
3. What are some of the undefined terms in the section above? 
4. Which of the above properties make the following expressions unam- 
biguous: 

a+b6+c, abe? 


5. Using only the above properties, show that (a+6)+(c+d) = 

(a+[b+c])+d. Point out at each stage exactly what property is used. 
at expression does this make unambiguous? 

6. What common axiom in plane geometry is expressed by the third 

Property of equality listed above? 

7. Which of the following have the four properties listed above for 


' 
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equality of natural numbers: (a) is the son of; (b) is the brother of ; (c) is 
in the same family as; (d) is congruent to. Explain. For example, in case 


(a) the reflexive property would hold if “A is the son of B” implies “B is 
the son of A.” 


1.3. Inequalities 


Suppose we have two sets of objects, namely, A, containing a objects, 


and B, containing ὃ objects. We say that a is greater than ὃ (written 


a > b) if we have to add some objects to the set B to have just as many 
things as there are in A. More precisely, we give the definition: a > ἢ 
means: ὃ + x = a is solvable for some natural number x. The symbol is 
read: a is greater than b. 

Another way to express the relationship, a greater than ὃ, is to say: 
ὃ is less than a, that is,b <a. Both > and < are symbols of inequality 
and have the following properties expressed in terms of greater than for 
the natural numbers a, b, and c: 


1. One of the following and only one must hold: 
a>ba=b,a <b. 

2. Ifa >band ὃ >c, then a > c (the transitive property). 

3. Ifa >b, thena+c >b+ cand ac > be. 


To prove the first property suppose, for instance, that both a = b and 


a >b. Then, from the definition of inequality, there is a natural number 
c, such thata = b+ c, But a = ὁ, by hypothesis, and hence by the tran- 
sitive property of equality,b = b+ c. But we cannot set up a one-to-one 
correspondence between ὃ +c objects and ὃ objects. Thus we cannot 
have both a= ὃ anda » ὃ. The rest of the first property can similarly 
be established. 

Before proceeding with the proof of the second property it may be 
well to consider the type of argument used in the previous paragraph — 
a form of argument which goes by the name reduction to absurdity or its 
Latin equivalent reductio ad absurdum. This has its basis in the axiom of 
logic that if P is true and P implies Q then Q must be true. It follows that 
if P implies something which is false then P must itself be false. There 
are many examples of such arguments outside of the field of mathematics, 
One example would be the following: “1 know John has not left the house 
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since the snowfall, since if he had there would have been tracks in the 
snow and there are none.” In order to establish the desired result, it was 
supposed false and shown that such a supposition would have false con- 
sequences. 

The second property may be proved as follows: a > ὃ and ὃ >c 
imply the existence of natural numbers x and y such that a = ὃ + x and 
b=c+y. Then, by use of the well-defined property of addition, we may 
replace ὃ in the first equation by c+ y and have a= (ὁ τ ψ) τα πον 
(y + x) by the associative property. However, y +z is a natural number 
by the closure property, and it may be designated by z. This shows that 
a =c+zand hence a >e. 

The third property we leave as an exercise. 

We shall also use on occasion the symbols a 2 c or a > c to mean “a 
is greater than or equal to c,” that is, ‘‘a is not less than c.” We also have 
the corresponding symbol with the inequality reversed. 


1.4. Cancellation properties 


The well-defined property of addition showed us that if ὃ = c then 
a+b=a+e. The converse of this statement is what we call the can- 
cellation property; that is, 

5a. Ifia+b=a+e, then ὃ - 6. 

This may be proved as follows: If ὃ is not equal to c, then, by the first 
property of inequalities, ὃ must either be greater than c or less thane. We 
may without loss of generality assume that ὃ is greater than c, that is, 
for some natural number d, b=c+d. Then a+b=a+c becomes 
a+c+d=a+ce, which, as we have seen, is impossible. This estab- 
lishes the cancellation property of addition. 

Similarly, for multiplication the cancellation property is 

5m. If ab = ac, then ὃ = 6. 

The proof is left as an exercise. 


1.5. Critique 


Some readers may be made uneasy by the above formulations, for 
there are many undefined terms and concealed assumptions (for instance, 
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we considered glasses on trays). Though in the most rigorous treatment it 
is not possible to eliminate all such things, there is another way of defining 
the positive integers due to Peano, an Italian mathematician of the nine- 
teenth century, which is much more precise in that the assumptions are 
reduced to a minimum. His axioms may be formulated as follows: 


1. There is a natural number 1. 

2. Every natural number ὃ has a unique successor, which is also a 
natural number and which we call b*; also b is called the predeces- 
sor of bt. 

The integer 1 has no predecessor. 

If at = bt, then a = ὃ. 

. Addition is defined by a + 1 = at and a+ bt = (a+ δ}7. 

. Multiplication is defined by a- 1 = a, a: δὴ = ab +a. 

(The induction postulate) A set of natural numbers A must in- 
clude all the natural numbers if it has the following properties: 
a. A contains the number 1. 

b. The successor of any number of A is also in A. 


ND OR φ9 


From these assumptions all the above properties of natural numbers, 
and many others, may be proved. The uneasy reader might be interested 
to see to what extent he can do this. 

The last axiom above deserves special attention, for it allows us to 
use the method of proof by induction. This can be seen by taking the 
set A to be the set of integers for which a certain theorem is true. Then, 
to show that the theorem is true for all integers, that is, that A contains 
all integers, we must show both of the following: 


1. The theorem is true for the number 1. 
2. If the theorem holds for any number of A, then it holds for the 
successor of that number. 


For example, consider the following important 


Theorem 1.5. If ais a natural number and ὃ is another natural num- 
ber not greater than a, then one of the following holds: 


a = qb, 
a=qb+r,r <b, 


where g and r are natural numbers. 
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We prove this by induction on a. It is certainly true if a = 1, since 
the requirement that ὃ < a implies ὃ = 1 and g= 1. 

Now, suppose the theorem is true for the predecessor of a, that is, 
a-—1. Weshall prove it true for a. By our assumption, one of the fol- 
lowing holds: 

(1) a-li-= qb, 
(2) a—l=qb+r,r <b. 

First, if (1) holds, then a= φῦ - 1. It follows that either b= 1 and 
a= (q+ 1)b, which is the first case of the theorem, or 1 < ὃ, which is the 
second case. 

Second, if (2) holds, we have 


a=gb+r+1. 


Then, if b= r+ 1, we have a = (¢+ 1)b, which is the first case of the 
theorem. If 6 >r+1, the second case of the theorem holds. Thus our 
proof is complete. 

Third, we assumed in the above cases that b << α -- 1. If this is not 
the case and b < a, then b = a, and the theorem trivially holds. 

An immediate consequence of the theorem is 


_ Corollary 1.5. If ὃ is a natural number less than a, then there is a 
unique natural number q such that 


gob < aand (ᾳ - 1)b >a. 


The number g of this corollary is called the greatest integer in a/b and 
is written q = [a/b]. 


Exercises 1.5 


- Complete the proof of the first property of inequalities. 
- Prove the third property of inequalities. 

- Prove the cancellation property of multiplication. 

- Show how the corollary above follows from the theorem. 


ah ὦ we = 


- Show how the postulates of Peano may be used to prove the associative 
Property of addition. H1yr: For property 4a, use induction on 6. 
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6. Show that if a and ὃ are natural numbers, then ab 2 a. 

7. What properties do inequality and equality have in common? 
8. Show by induction or otherwise that any set of natural numbers, 
finite in βιὰ contains a greatest natural number. 


9. Let @1,02,a3, "γᾶς be a sequence of natural numbers such that ‘om 
is less than the one which precedes it. Show that k S ay, in other words, 


that the number of numbers in the sequence cannot be greater than the 


first number. 


1.6. Zero, subtraction 


We have defined a > b to imply the existence of a natural number ¢ 
such thata=6-+c. We then, by definition, say that cis a “minus” ὃ; 
that is, 


a=b+cmeansa—b=c. 


This is appropriate, since, by the cancellation property of addition, there 


is exactly one number ¢ which, when added to b, givesa. (α - ὃ οἱ 


would imply 6+ c’=b-+c, and hence, by the cancellation property, 
c’=c.) Notice that a — ὃ is defined only if a is greater than ὃ. 
It is convenient to define a number, zero, by the equation 


0O+a=-a+0=a. 


Then, if the definition of inequality is to be extended to include zero 
we see that every natural number ὃ is greater than zero because 0 + z = ὃ 


has the solution x = ὁ. Furthermore, as in Section 1.4, we can prove the 


cancellation property of addition for natural numbers and zero; that is, 
zero is the only number which, when added to a number, leaves it unal- 
tered. Hence we may define a — a to mean zero. If zero is to be included 
in the numbers which have the distributive property, we must define 
multiplication by zero in such a way that 


ab = (a+ 0)b = αὖ +0-b. 
Hence we must have 0-b=0. Thus we define 
0-b=b-0=0. 
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We have discussed sets having n objects for any natural number n. 
It is convenient to define a null set as one similarly associated with the 
number zero. Thus the fact “ΝΟ normal cat has two heads,” could be 
stated as follows: The set of normal cats having two heads is a null set. 
Similarly, if there are no cows in pasture A, we could say that the cows 
in pasture A form a null set. It follows that if A contains a objects and 
B is a null set then the set obtained by combining A and B has a+0 
objects, that is, a objects. Also ὃ - 0 would be the number of objects in 
b null sets, which is zero. However, there seems to be no very satisfactory 
interpretation along the same lines of the product 0 - ὃ. The most satis- 
factory way of thinking of zero is merely as a number having the prop- 
erties 

0+a=a+0=a,0-b=6-0=0. 


Of course there is always the possibility that several requirements 
which we place on a new number may lead to a contradiction. However, 
we shall here assume that no such contradiction exists. It can be shown 
that the above definitions of addition of zero and multiplication imply 
all the properties listed for natural numbers in Section 1.2. 

With the use of zero, Theorem 1.5 can be stated in a different way: 


Theorem 1.6. For any “numbers” a and ὃ, there are numbers g and 
r such that 
a=qb+r,r< ὃ. 


Here the word “number” shall include all natural numbers and zero, 
except that we exclude b = 0. 


1.7. Negative integers 


If we confine ourselves to the natural numbers and zero, the expres- 
sion a — ὃ has meaning only if ὃ is equal to a or less than a. This is an 
unnatural restriction which it is convenient to eliminate. We do so by 
the introduction of negative integers, as follows: If b is a natural number 
or zero, we define a number ~b by the equations 


(1) b+-b="b+b=0, 
and we shall say that ~b = ~c if and only ifb=c. Thus 
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(2) "Ὁ = 0, since 0+ 0 = 0. 


These new numbers exclusive of 0 we call negative integers. | 

First, it is necessary to define addition for these new numbers. If the 
associative and commutative properties of addition are to hold, addition 
of two negative numbers will require the following: 


“a+ b+ (a+ ὃ) = (a+7a) + (6+7b) =0. 
On the other hand, 


“(a+ ὃ) + (a+b) =0, 
Thus 
“a+ b+ (a+b) Ξ τ(α - ὃ) + (a+b). 


and, if the cancellation property of addition is to hold, we must define 
(3) “a+b - -(α - ὃ). 


To define the sum of a natural number and a negative integer we need 
to consider two cases. Suppose a > b, then, by the definition of a — b, 
we have (a — ὃ) + ὃ = a, and hence we must define a + ~b so that 


a+b - [(α -- ὃ) - 07 Ἡ τὸ = (α -- ὃ) + (+b) = @—b)+0=a-b. 


Similarly, ~b + α Ξ α -- δ. On the other hand, if a « ὃ we would need to 
define addition so that the following would be true: 


a+ ~b=a+ (ὃ -- α) -Ὁ τ(ὖ -- α) - τὸ Ξ τ -- αὐ ὖ Ἐτῦ τ τῷ -- a). 


Thus we define addition of a natural number and negative integer as 
follows: 


(4) Ifa >b, thena+-b=a-—b=~b+a. 
Ifa <b, thena+~b="b+a=~(b— a). 


Thus we may replace a +~b by a — ὃ on the understanding that when ὃ 
is greater than a, a — b means the negative of b — a. 
We wish the distributive law to hold for the new numbers, and hence 


0 = (6+ ~b)a = ba + (-b)a = ba + ~(ba). 
Thus we define multiplication as follows: 
(5) (“b)a = a(-b) = ~(ab) = ~(ba), 


1.7. Negative integers 13 


where a and ὃ are natural numbers or zero. It remains to define the prod- 
uct of two negative integers. We want to make this definition so that 

0 = (ὃ - “δ)(Γα) = ba) + ((b)(a) = ~(ba) + ((b) (a) = ~(ba) + θα. 
Hence we define 
(6) (~b) (Ta) = ba = (“a)(). 

From these definitions (1) to (6) all the properties listed in Section 
1.2 for the positive integers and zero follow for the negative integers. 

We can now say for all integers that a <b if there is a natural 
number ὁ such that a+c=b. Thus, for instance, —11 < —1 since 
-11 Ἐ 10 Ξ --Ἰ If an integer is greater than zero, we call it positive. 

The natural numbers, zero and the negative integers, together form 
what we call the integers. The set of positive integers coincides with the 
set of natural numbers. 


Exercises 1.7 


1. Show that Theorem 1.6 follows from Theorem 1.5. 
2. Show that the definitions imposed on ὃ and the properties of positive 
integers imply for a, ὃ, 6. 

a. a+b =c+~b implies a= 6. 

b. ("a)b = (“a)c implies ὃ = c. 
3. Justify ~(—b) = b. Is this a definition, or can it be proved? 
4. Is subtraction of integers commutative? Is it associative? Is ad- 
dition of integers commutative? 
5. What properties of inequality listed in Section 1.3 hold for all integers 
including zero? 


6. Do the cancellation properties of addition and multiplication hold 
for integers? 


7. Does α -- ὃ = a—cimply ὃ = c for integers a, ὃ, οἵ Why or why not? 
8. Simplify the following expression, pointing out at each step what 
Properties of numbers you are using: 

c(a + ba) — ac + α -- abe. 


Here a, ὃ, ὁ stand for integers. 
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9. Integers could be defined as pairs of natural numbers in the following 
way: 

a. (a, b) = (c, d) meansa+d=c+t+b. 

b. (a, δ) + (c, d) means (a+ 6, ὃ τ᾿ ἃ). 

6. (a, b)(c, d) means (ac + bd, ad + be). | 
First, basing your proof upon accepted principles for natural numbers, 
show that addition and multiplication are associative and commutative. 
Second, show that 

(z+ a, a) + (y+b, b) = (cx +y +a, a); (a, a) + (ὃ, c) = 0, 0); 
(1 + a, a)(b, c) = (0, ¢). 
Third, show that 
(x + a, a) + (a, x + a) = (a, a). 

All the ‘properties of negative integers may be derived from the 3 
definitions. What number pairs correspond to zero? (Ref. 8, pp. 5 ff.) * 
10. Prove that a « ὃ impliesa—c «ὃ -- 6. 


1.8. Divisibility 
If a and ἢ are two integers, we say that a is divisible by ὃ, or that a 
is a multiple of b, if there is an integer c such that a = bc. Note that we 
introduced negative integers so that the equation a + x = δ would always 
be solvable, but note that without introducing fractions we cannot say 
that ax = b is solvable in all cases. When we deal solely with integers, 
the problem of divisibility is of fundamental importance. If we were 
talking about rational numbers (ratios of integers), there would be no 
problem of divisibility, since every number would be divisible by every 
other number provided the divisor were different from zero. | 

The notation often used for “Ὁ divides a” is bla and “b does no 
divide a” is written b + a. 

It is not hard to prove 

Theorem 1.8a. Ifa|bandalc, then αἱ + cand albe. 


Theorem 1.8b. If αἱ and bic, then ale. 


* The reference numbers refer to the Bibliography in the back of the book, 
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Theorem 1.8c. If alb and bla, where a and ὃ are integers, then 
a= bora=~"b. 

Every positive integer is divisible by itself and 1. Integers greater 
than 1 having no other divisors are called prime numbers or primes. 
Numbers greater than 1 which are not primes are called composite. Thus 
the natural numbers fall into three mutually exclusive categories: 


1. The number 1, 
2. The prime numbers, 
3. The composite numbers. 


Two numbers are called relatively prime (or one is prime to the other) if 
their only common divisors are 1 and —1. For instance, 6 and 35 are 
relatively prime, but 6 and 15 are not. 

The prime numbers 2,3,5,7,11,13, --- are irregular in their distribu- 
tion, and some of the deepest theorems in the theory of numbers, proved 
usually by delicate analysis, have to do with the prime numbers. For 
instance, it has been shown that 


im a(x) (log, x)/x = 1, 


where 7(z) is the number of prime numbers less than z. That is, by taking 
« large enough, w(x)(log, x)/x can be made as close as we please to 1. This 
is called the prime number theorem. Theorem 1.8g later in this section 
states that there is no last prime number. The proof is quite simple. 

The greatest integer which divides two integers is called their great- 
est common divisor, often abbreviated g.c.d. For instance, the g.c.d. of 
6 and —15is3. Using the notation (a,b) to denote the g.c.d. of a and ὃ, 
we would have 


(6, -- 15) = 8. 


Though the above is perhaps the most natural way to define the 
g.c.d., there is another way which is sometimes more useful in proofs. 
Define g, the g.c.d. of two integers a and b, to be that positive integer 
which has the following two properties: 


1. g divides both a and b. 
2. Any number dividing both a and ὃ also divides g. 


That these two definitions yield the same number may be seen as 
follows: Let σι be the g.c.d. according to the first definition and g. accord- 
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ing to the second. The first definition would then imply gs < σι and the 
second that σι would divide go. Hence σὰ < g2, and it must follow that 
Q=9. 

Both definitions can immediately be extended to any number of 
integers. ‘ For instance, we can write 


(6, 21, 35) = 1. 


That any pair of integers has a greatest common divisor can be 
shown from the first definition, for any divisor of a positive integer mus 
be either the number itself or less than it; hence a positive number cannot 
have more than a finite number of divisors. Then, to find the greatest 
common divisor of two numbers, we need merely list their divisors and 
pick out the greatest one common to both. However, this means of find- 
ing the g.c.d. is laborious. There is a somewhat shorter method which de- 
pends on Theorem 1.6 above and the second definition of g.c.d., which is 
as follows: | 


Theorem 1.8d. If a and b are two positive integers, then the process 
of Theorem 1.6 is repeated as follows: 


a=gb+n,0<S17 <b, 
ὃ -»" Qi?i + Τὰ» 0 < T2< Τι, 
γι Ξ Qt2t7s,0 S173 « γ, 
Tr-3 = (οὔ. + Terr, Ὁ S Te-1 < Te-2, 
Te-2 = (αὔρα + Tr, OS Te < Tye-1. 
Eventually one of the 7’s will be zero, call it 7,, and suppose 7;,_; γέ 0. 
Then 7,-; will be the g.c.d. of a and ὃ. The process is called the Euclid 
Algorithm. (An “algorithm” is a term applied to a repetitive process like 
the above, used to achieve a desired result.) 


First, let us see how this process works in a numerical case. 
a = 228 and ὃ = 177. Then we have 


228 = 1-177 + 51, 
177 =3-51 +24, 
51 = 2-24 +3, 
24 -- 8.8 +0. 


The theorem states, thus, that 3 is the g.c.d. of 228 and 177. 
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To prove this, notice first that one of the r’s must finally be zero, 
since, until we reach the point where one is zero, they form a decreasing 
sequence of positive integers. (See Exercise 9 of Section 1.5.) Thus the 
last equality will be 


Tea = Geta + 0, 
and γι divides 7,2. Then Theorem 1.8a and 
Tr-3 = Qe—Fe—2 + Tr—-1 


imply that σι. divides r,s. We can continue to follow this back until 
in the first equation στρ. divides 7; and ὃ and hence a. Therefore we 
have demonstrated that σι. divides a and ὃ, the first requirement of 
g.c.d. 

Since the first equation may be written a — gb = τι, then any com- 
mon divisor, d, of a and ὃ divides γι. Since ἃ divides ὃ and r; it must, 
from the second equation, divide rz, and, continuing, we see that d divides 
ry. Hence the second requirement of the definition is fulfilled, and we 
have proved Theorem 1.8d. We have climbed up the ladder and climbed 
down again. 

There is still a third definition of g.c.d. which we include in the [ol- 
lowing: 


Theorem 1.8e. Given two nonzero integers a and b, let r be the least 
positive integer expressible in the form 


ax + by ΞΞ Υ, 


with z and y integers. Then r is the g.c.d. of a and b. 

Again, using the second definition of g.c.d., we see that the second 
condition is satisfied, for Theorem 1.8a shows that any common divisor 
of a and b divides r. To show that r divides a and ὃ, suppose that r does 
not divide a. Then 


a=oqr+s,0 <8 <7. 


Furthermore, there are integers 2, y such that az + by =r, and 
qr = φαχο + qbyy. However, a = gr +s implies gr = a — s = gax + qbyo. 
Hence s = a(1 — gx) — bay. Thus s is a positive integer less than 7, 
representable in the form az + by, with « = 1 — gm and y= -- σύο. This 
result contradicts our assumption that r is the least integer so expressible 
and completes the proof. 
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We have the following important 


Corollary 1.8e. 
for by the above theorem we know there are integers x and y such that 
mz + ay=1, whence m(bz) + y(ab) = b. However, m divides m(bz) 
and ab, which by Theorem 1.8a shows that m divides b. 


Corollary 1.8e’. If m is prime to a and to ὃ, then it is prime to ab; 
for, if m had a factor in common with ab, it would have a prime factor in 
common, Call this common prime factor p. By the first corollary, p 
would divide b, which would imply that m and b have a common factor 
in contradiction to the hypothesis. 

These then yield the Fundamental Theorem of the theory of numbers: 


Theorem 1.8f (The Fundamental Theorem). If a positive integer is 
factored into prime factors in two ways, as follows: 


Pip2°°° Pr = σι 2" "" Vs; 


where two or more p; or two or more g; may be equal, then r = s and the 
factors p; are equal to the factors g; in some order. Briefly, we then say 
that decomposition into prime factors is unique except for order of the : 


prime factors. 

To prove this, notice that a prime either divides a number or is prime 
to it. Suppose p; is not equal to any of the q,’s. [{ this is true then 
it must be prime to each one and hence by Corollary 1.8e’, to their product. 
This is not possible, since ρὲ divides their product. Hence we may re- 
number the q; if necessary and take pi = φι͵. Then the cancellation the- 
orem for multiplication shows that 


P2ps*** Pr = 9293 """ Qe; 


and we may deal with Ὁ: as we did with p;. Continuing in this way we see 


that the theorem is true. 


This theorem also shows that we can express any integer in the form | 


a= p, ΓᾺς see p.”, 


where now no two p; are equal. This decomposition again is unique ex- 


If m divides ab and (m,a) = 1, then m divides b; 


cept for the order in which the prime factors are written. Suppose the | 


only prime numbers dividing both a and ὃ are 
Pi, Pa, """) Dn» 
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Then we can express a and ὃ in the following forms: 


a, ἃ ..% 9 Ό δ ὃ δ w w 
a= DP,” eee Δα δος Ay **#e r=3b = Pp} DP,’ “εν p*s,'8," eer 8", 


where no two of pi, r;, 8, ate equal. The g.c.d. of a and ὃ is equal to 

pi pe eee p.*, 
where, for each 7, c; is the smaller of a; and b;. How would one express the 
least common multiple (1.c.m.) of a and δῇ 

Any number which is divisible by 2 is called an even number, and all 
others are odd. If two numbers are both even or both odd, they are said 
to be of the same parity; if one is even and the other odd, they are said 
to be of opposite parity. Since every even number is of the form 2k for an 
integer k, and every odd number of the form 2k + 1, it is not hard to see 
that the sum of two numbers of the same parity is even and that the sum 
of two numbers of opposite parity is odd. 

Euclid proved that there is no greatest prime number, that is; there 
is an infinity of prime numbers. We may state it in the following way, 
and we shall give Euclid’s proof. 

Theorem 1.8g. 
primes. 

To prove this, suppose that there are only n primes and list them as 
follows: 


For any positive integer n there are more than ἢ 


P1, P2, ***, Dn 
Then form the number ; 


N = pip2**+ Pat 1. 
If one of the n primes divided N, it would, by Theorem 8.18, have to 
divide 1, which is impossible. Then either N is itself a prime, or it is 
divisible by a prime different from those listed. In either case there 
must be more than » primes. 

The least common multiple of two integers a and b is defined to be 
the smallest positive integer which is divisible by both a and b. This 
Statement is equivalent to the following: m is the least common multiple 
of a and ὁ if it has two properties: 


1, mis a multiple of a and ὃ. 
2. m divides every common multiple of a and b. 


This bears some similarity to our second definition of g.c.d. 
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1. Show that Theorem 1.8e implies the following: If the multiples of a 
are plotted in red on a line and the multiples of ὃ in green, where a and ὃ 
are positive integers with greatest common divisor g, then g will be the 
shortest distance from any green to any red point. 


2. By the process of Theorem 1.8c find the g.c.d. of 576 and 73. Use this 
same process to find integers x and y so that 5762 + 73y = (576, 73). 


3. Prove Theorems 1.8a, 1.8b, and 1.8c. 
4. If (ab, p) = 1, show that p**\ap* + bp* only when k& = 8 and pla + ὃ. 


5. Show that (a,b) =d implies (a/d,b/d)=1. Show that a(b,c) = 
(ab,ac). 


6. Prove that if (a,b) = 1 and r divides ab then r = st where sla and 


‘|b and s and ἐ are uniquely determined except for sign; s or ἐ or both 


may be 1. 


7. Prove that if ab/d is a multiple of a and δ then d divides both a and b. 
Hence show that ab/d is the least common multiple of a and b if and only 
if d = (a,b). 
8. Prove that if a/b and c/d are two fractions in lowest terms, and if 
a,c ad+be 
ee ἄγον "ὦ 
is an integer, then δ᾽ ἃ and d|b; thatis,b = +d. 
9. Prove that the sum of an even number of odd integers is even and the 
product of a set of integers is even unless they are all odd. 


10. Prove that the product of an odd number of integers of the form 
4n — 1 is of the same form. 


11. Using Theorem 1.6 with b = 4 show that every odd integer is of one 
of the forms 4n — 1, 4n - 1, and hence that if the product of an odd 
number of odd integers is of the form 4n + 1 at least one is of the form 
4n+ 1. 


12. Extend the definition of least common multiple to more than two 
integers. 
13. Show that every integer not divisible by 3 is of one of the forms 


{ 
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3n + 1 or 3n + 2, and if an integer has the form 3n + 2 at least one of its 
prime factors is of the form 3n + 2. 
14. Prove that N in Theorem 1.8g has at least one prime factor of the 
form 4n — 1 and hence there is no greatest prime of the form 4n — 1. 
15. Theorem 1.6 shows that every integer s can be expressed in the form 
ka + r, for a given integer a, where 0 < r <a. Which one of the follow- 
ing set of integers is divisible by a: 

8,3 +1,8s+2,°°-,sta-l1? 
Show that there is only one of the set divisible by a. In other words, any 
set of a consecutive integers contains exactly one multiple of a. 
16. Use the result of the previous exercise to show that 2° — 5z* + 4z is 
divisible by 60 for every integer z. 
17. For a given number of steps, ᾿ς, in the Euclid Algorithm show that a 
and b will be the least if 


re = 0, Th. = l, 7κ.--- >= 2, Te-3 = 3, 14> 5, 5 = 8, 7m 


where for each subscript ὁ « k, rie = 7i-1 +7; and q;_1=1. Thusaandb 
would have to be two successive numbers in the sequence, the first ten 
terms of which are 

0, 1, 2, 3, 5, 8, 18, 21, 34, 55. 
That is, if the Euclid Algorithm process has five steps, the least pair of 
values of a and b is 8 and 13. Why does this imply that the Euclid 
Algorithm for (a,b) with ὃ « 34 must contain less than 7 steps? 
18. If a=bq+r and ὁ = bt+s, show that a—c and r—s are either 
both divisible by ὃ or both not divisible and hence that the difference of 
two numbers is divisible by b if and only if their remainders when divided 
by b are equal. 
19. Prove that if f(z) is a polynomial with integer coefficients and if a is 
an integer not zero such that f(a) = 0 then a divides the constant term. 
20. Prove that if p and g are integers and p* = 20" then p and q are both 
even, 
21. Prove that if c? = ab where z,a,b are positive integers and (a,b) = 1 
then there are integers y and z such that 


yz=2,a= y and ὃ = 2’. 
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22. Prove that if a and ὃ have a common prime factor and a?+ δ = οὗ 
then that common prime factor divides c. Would the same result neces- 
sarily hold if the word “prime” were omitted? . 


23. Prove that if (b,c) = 1 then (Ὁ -- οὐ - δ᾽ Ξ or 2. Under what 
conditions will the latter g.c.d. be 1? Answer the similar questions for 
(ὃ + 2c, 2b +c). 
24. Show that the pair of equations, ax + by = r, αἰ + b'y = 1’, where 
all the letters stand for integers and ab’ — αὖ = ἃ τέ 0, have integer 
solutions if and only if rb’ — r’b and r’a — ra’ are divisible by d. | 
25. If a? = οὗ — b, (c,b) = 1, and just one of οἷ is even, use Exercise 
21 to show that there are integers m and n such that mn = a, ὁ — ὃ = πηι, 
σ-Ἐ =n’. 
26. If (c, b) = 1 and both c and ὃ are odd, show that 

a? = οὗ — ὃ 


implies that a is even. Show that (ὁ — b)/2 and (ὁ + b)/2 are relatively 
prime integers and equal respectively to m? and n®, where m and ἢ are 


integers whose product is a/2. Show how this may be used to give all 
solutions of 


a’ + δ = οἷ, 


where a is even, b is odd, and no two of a, b, c, have a common factor 
greater than 1. 


27. The following illustrates for the numbers 35,15,12,6 a device for 
finding the l.c.m. of a set of numbers. Why does it work? 


Non nw Go ὧν 
μι i μι ht Cn 
Ke won μα. μὰ. 
μ᾿ = me DD DD 


Explanation: Since 3 divides three of the numbers, we use it for the first _ 
row. Where it is a divisor of the number in the line above, we place the 
quotient in the row; when it is not a divisor, we repeat the number above. 

Since 5 is a divisor of two numbers of the first row below the horizontal 
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line, we use it next to obtain the second row. So we continue until a row 
consists entirely of ones. The l.c.m. is the product of the numbers in the 
left column, here 3,5,2,7,2. The order in which we divide by these num- 
bers makes no difference to the result. However, at each step it is im- 
portant to begin a row only with a prime divisor. 


28. Prove that for any two integers a and ὃ there are integers ᾳ and r 
such that 


a=bq+r,—b <2r<b. 


This forms the basis for the so-called least remainder algorithm. Carry 
this through for 576 and 73. Show that this process always yields the 
g.c.d. 


29. Establish results similar to those of Exercise 17, but use the least 
remainder algorithm. 


30. Show that “equivalence” in the following sense has the four prop- 
erties of equality listed in Section 1.2: Given a fixed integer m, call “a 
equivalent to b” if a — ὃ is divisible by m, for integers a and ὃ. 


31. If (a,m) = 1, for what integers x will az be “equivalent to” ab in the 
sense of the previous exercise, where ὃ is also an integer? 


1.9. The decimal notation 


The properties of integers in the preceding sections are independent 
of what we call them. We say, “three divides six.” If the Martians 
designated three objects by the word “glob” and six objects by the word 
“toes” while our “divides” is pronounced by them “‘sots” then it would 
be equally true that ‘glob sots tocs.” The corresponding result would 
hold, whatever the words or notation. However, some notation 1s neces- 
sary, and we are accustomed to the “decimal notation,” associated with 
the number ten, since we have ten fingers, and say our number system 
“has the base ten.” 

If we had twelve fingers, we would undoubtedly employ the “duo- 
decimal system,” where the base would be twelve. If we had just eight 
fingers, it would be the ‘‘octic system,” of base eight. 


We have, in fact, just ten different symbols: 0,1, ---,9, and make 
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the place where they occur important. For instance: 23 is understood to 
be 3 plus two tens and 576 is six, plus seven tens, plus five hundreds. Also 


769301 = 1+ 0-10 + 3-10? + 9-10* + 6.10 + 7-108. 


The zero can be considered to indicate absence of the corresponding power 
of ten. Using letters, we could write 


abcde = e + 10d + 10% + 10% + 104, 


where a,b,c,d,e are the digits of the number. (We call the symbols, 
0,1,2, --+,9 digits, since each can be associated with a finger of the hand, 
zero being a little finger.) 

From this it follows that if the last digit of a number is divisible by 
5, the number is also. Why? If the last digit is even, the whole number is 
divisible by 2, therefore, even. Why? If the last two digits form a num- 
ber divisible by 4, why is the number divisible by 47 Why cannot one 


tell from the last digit whether or not a number written in the decimal | 


system is divisible by 3? 


If our number system had the base three we would need only the 


symbols 0,1,2. Then abcde would mean 
e+ 3d + 850 + 3% + 3a, 


and the number would be divisible by 3 if e were zero, but we could not 
tell from the last digit whether or not it would be divisible by 2. 

In any system of notation the number for three would divide the 
number for six, but telling by inspection whether or not it is divisible 
would be easier in one notation than in another. 

In some respects the number system to the base two, the so-called 
binary system, is the simplest. It has only two symbols: 0 and 1. It is 
especially adapted to machine calculation, since 0 can correspond to the 
lack of a certain electric connection and 1 can correspond to its existence. 
The addition table is merely 


0+0=0,0+1=1,1+1=10, 
and the multiplication table would be the joy of every schoolboy: 
0-0 = 0,0-1=0,1-1=1. 
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An example of multiplication would then be 


1011010 
101 
1011010 
0000000 
1011010 _ 
111000010. 


The chief disadvantage of this notation is that it requires much 
writing for relatively small numbers. In the decimal notation the two 
numbers above would be 90 and 5, and the result, 450. To see this, notice 
that the digits in the binary form counting from the right indicate the 
presence (when 1 occurs) or absence (when 0 occurs) of the corresponding 
power of 2. Thus we can make a table for the numbers as follows: 


Number to the base two Ὡ" ὯΝ Ὁ Ὁ ee ae eS A Φ 
1011010 {ΘΑ ΕΙΣ 
101 Ὁ. 
111000010 1 1 δ 8.08, 9.4.8 


Τὸ find the decimal value of the first number we add the powers of 2 un- 
der which 1l’s occur: 2+8-+ 16+ 64 = 90. The decimal values of the 
other numbers may be found similarly. 

One may convert from the decimal system to the binary system in a 
like manner. Suppose we wish to write 49 in the binary system. The 
highest power of 2 less than 49 is 2°; hence 49 = 25+ 17. Then 17 = 
2‘-++ 1. Continuing in this fashion, we have 49 = 25 + 24+ 0- 2° +0-2?+ 
0.2 1, which, in the binary system, becomes 110001. 

A convenient method of converting from the decimal system to the 
binary is indicated by the following process: 


0 1 3 6 12 2 49 


We write 49, the number to be converted, at the right and divide by 2, 
writing the quotient, 24, to the left of 49, and the remainder below it. 
Then we divide 24 by 2, writing the quotient, 12, to its left and the re- 
mainder, 0, below it. We continue in this way and see that 49 is 110001 in 
the binary system. 
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One application of the binary notation is in the so-called Russian 
Peasant method of multiplication. This is a method by which the product 


of any pair of integers may be found by a succession of divisions and 
multiplications by two and one addition. 
The process for 375 and 39 is as follows: 


39 375 
19 750 
9 1500 
4* 3000 
2* 6000 
1 12000 
14625 


In the left column we divided by 2, disregarding remainders, and in the 
right column multiplied by 2. In the left column we placed asterisks after 


the even numbers and, in the addition, omitted the numbers to the right of 


the stars. The reason for this process may be seen by writing 39 in the 
binary system, namely, 100111. Writing these in a column with the first 
digit at the bottom, it is seen that the asterisks correspond to the two 


zeros. Thus, to multiply 375 by 100111 in the binary system, we multiply 


375 by 1, since the last digit for 39 is 1; multiply it by 2, since the next to 

the last digit for 39 is 1; multiply it by 4, since the third from the right 

digit of 39 is 1; we do not multiply it by 8 or 16, since those digits are ZeTO, 

_ we multiply it by 32. This is accomplished by the addition procoss 
ve. 


Exercises 1.9 | 


1. Multiply 75 by 423 by the Russian Peasant method. Convert these 


numbers to the binary system, multiply and check by converting back — 
to the decimal system. 


2. Four cards are constructed. The first, headed 1, has all the odd num- 


bers between 1 and 16; the second, headed 2, hus the numbers: 2,3,6,7,10, 
11,14,15; the third, headed 4, has the numbers: 4,5,6,7,12,13,14,15; the 


fourth, headed 8, has all the numbers from 8 to 15 inclusive. You ask 


someone to choose a number between 1 and 15 and to hand you the cards 
on which this number occurs. You then add the numbers at the tops of 
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those cards and tell him the number he chose. How would you number 
five cards to pick a number from 1 to 31? Why does it work? 


3, Suppose, instead of wanting to answer ‘‘yes” or “no,” it were to be 
one of “yes,” “no,” or “maybe.” What number system would you use 
instead of the binary system? What assumptions about “yes,” “no,” 
and “‘maybe” would have to be made? 


4. Translate 57 and 93 into the binary system, multiply them in that 
system, and check your result by converting into the decimal system. 


5, Four weights are to be chosen so that any object from 1 to 15 pounds 
(weighing a whole number of pounds) could be weighed by putting the 
object in one pan of a balance scale and balancing it with a proper assort- 
ment of the four weights in the other pan. What would they be? 


6. Show that if the weights above could be put in either pan, three 
weights would suffice for objects weighing less than 15 pounds. What 
would their weights be and how would the proper assortment be found 
for any object? 


7. Exercise 2 is connected with the base two. Could similar cards be 
constructed for a trick using the base three? 


8. Show that if the quotient when one divides a number by 2°, disregard- 
ing the remainder, is odd then 1 is the fourth digit from the right in the 
binary representation of the number, while if the quotient is even, the 
fourth digit will be 0. Generalize this for any power of 2. 


1.10. Rational numbers 


Though the theory of numbers is primarily concerned with integers, 
there are a number of situations in which rational numbers are involved. 
Furthermore, the place of the integers in the number system can better 
be appreciated if we see how rational numbers occur. 

Just as we introduced negative integers so that we could solve the 
equation ὃ + x = a, when a is greater than ὃ, so we determine a new kind 
of number, called a rational number, so that ba = a will be solvable (when 
α and ὃ are integers with ὃ τέ 0), having the property that 


(1) (a/b)b = b(a/b) = a. 


Ne ee 


| 
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This we call a “rational number,” since it is written as a “ratio.” When 


b = 0 we cannot consistently define this symbol, as we shall see later, 


Whenever we write the symbol (a/b), we assume b + 0. 


First, to define equality, consider the two rational numbers (a/b) — 


and (c/d), where bd ~ 0. From the definition, 
(a/b)b = a, (c/d)d = c. 
Then, if multiplication is to be well defined, we have 
(a/b)bd = ad, (c/d)db = ch; 


and, if (a/b) = (c/d), we must have ad = cb. On the other hand, if the 
cancellation property of multiplication is to hold, ad = cb must imply 
(a/b) = (c/d). We therefore make the following 

Definition of equality: The rational numbers (a/b) and (c/d) are said 
to be equal whenever 


(2) ad = be, bd τέ 0. 
An immediate consequence is the following 


Cancellation property of fractions: (a/b) = (ak/bk), k ¥ 0, since 


abk = bak. 
Since (a/1)1 = a = a- 1, we would define 


(3) (a/1) = a. 


Hence all the integers are rational numbers. This includes a = 0. s 
Since we want the distributive and associative properties for multi- 
plication to hold for rational numbers we need to have the f ollowing: 


{ (a/b) + (c/d)} - bd = (a/b) - bd + (c/d) - bd 
= (a/b)b- d+ (c/d)d-b 
= αὐ -ἰ οὗ, 


where the first equality assumes the distributive property for rational 
numbers, the second the associative property for rational numbers and 


the commutative property for integers, and the last equality results from 
the definition of a rational number. However, 


{ (a/b) + (c/d)} - bd = ad + cb, 
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which means that to be consistent we must define addition of rational 
numbers by 


(4) (a/b) + (c/d) = (ad + eb) /bd. 


Furthermore, we wish multiplication of rational numbers to be com- 
mutative and associative. 
Hence 


(a/b) - ὃ = a and (c/d)d = c should imply 
(a/b)b(c/d)d = (a/b) - (c/d)bd = ac. 


However, by our definition of rational numbers, (ac/bd)bd = ac. Hence 
we define multiplication so that 


(5) (a/b) (c/d) = (ae/bd). 


These five definitions suffice to prove that rational numbers satisfy 
all the properties of integers listed in Section 1.2. Also 


0 - (a/b) = (0/1)(a/b) = (0/b) = 0 = (a/b) - 0. 


Thus zero multiplied by any fraction gives zero, by definition. 

Now we can see why the exclusion of b = 0 in (a/b) is necessary if 
our rational numbers are to have the desired properties. Suppose we 
were to define (1/0) to be a number called “infinity,” and wrote 


(1/0) = ο, that is, 1 = 0- ὦ. 


Then, by the cancellation property of fractions, (2/0) = ~; that is, 
2=0-o, However, 


2(0- ©) =2-2=4and (2:0) ο =0- ὦ -- 2. 


Thus 4 + 2 shows that 2(0- οὐ # (2-0) «, and hence the associative 
property would not hold. This shows that if we were to call (1/0) a 
number it would be necessary to specify that it did not behave like the 
Other numbers. It is much easier to exclude it from the family of num- 
bers. We therefore say that (1/0) is not a number. In fact (a/0) is not 
&number. We do not even call it infinity, since that might be misleading. 

We call a rational number positive if it is equal to a/b, where a and b 
are positive integers. It is called negative if a and ὃ are integers of op- 
Posite sign (that is, one positive and one negative). Furthermore, if r 
and 5 are two rational numbers, we say that r « 8 if there is a positive 
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rational number wu such that r+ u=s. Subtraction of rational numbers 
is defined in the same manner as subtraction for integers. 


Exercises 1.10 


1. Show that the above definitions and the properties of integers imply 
that addition and multiplication of rational numbers are commutative 
and associative. 

2. Show that equality of fractions has the properties of equality listed in — 
Section 1.2. 


3. Suppose we define (a/b) = (c/d) whenever c = ka, ἃ = kb for k, a non- 
zero integer. Would the properties of equality then hold? If not, what 
would your answer be for k, a rational number? 


4. Using only the properties and definitions of the last section, show that 
a(1/b) = a/b. Point out what properties are used. 


5. If r= (a/b), s = (c/d), and 8 is not zero, define the symbol (r/s) to — 
have the property that 


(r/s)s = r. 
What must r/s be in terms of a,b,c,d? 
6. Show one property which would fail if we defined 
(a/b) + (c/d) = (α - οὐ (Ὁ - ὦ). 
7. Use Exercise 20 of Section 1.8 to show that there is no rational num- 
ber whose square is 2. 


8. What properties of integers listed in Sections 1.2 and 1.4 also hold for — 
rational numbers? 


9. Why does (0/6) 
numbers? 


10. Show that if a/b and c/d are two rational numbers where a,b,c,d are 
integers and b and d are positive then 


a/b > c/d if and only if ad > be. 


11. Show that if a/b and c/d are two rational numbers with ὃ and d posi- Ἷ 
tive and 0 « a/b <c/d then a/b < (a+c)/(b}+d) <c/d. Why does 


= 0 follow from the properties listed for rational — 
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this show that between any two distinct rational numbers there is always 
a third? Is this a property of integers? 

12. Let f(z) be a polynomial a,x" + @p12*! +--+ +a: +a whose 
coefficients are integers. Let r and s be two relatively prime integers. 
Prove that if f(r/s) = 0 then 8 divides a, and r divides a. Show that this 
implies that if a, = 1 all rational roots of f(z) = 0 are integers. 

13. Use the result of Exercise 12 to show 

a. There is no rational number whose square is 2. Compare Ex- 
ercise 7. 

b. The number ./2+ νῷ is not rational. Hrivr: Show that the 
given number is a root of an equation of the type described in 
Exercise 12. 

14. Suppose, for integers a,b,c,d, we make the following definitions for the 
symbol (a/b): 

a. (a/b) = (c/d) shall mean a = 6, ὃ = d. 

b. (a/b) + (c/d) = (a+c¢/b+d). 

ce. (a/b)(c/d) = (ac — bd/ad + be). 

Show that this system has all the properties listed at the end of Section 
1.2 for natural numbers. 


1.11. Decimals and real numbers 


As we all know, the decimal notation can be extended to represent 

certain fractions. For instance, we write 176.9304 to mean 
1-10? + 7-10' + 6-10° + 9-107? + 3-10* + 0-10* + 4.10.5. 
This means that only rational numbers which can be written in the form 
a/b, where b is a power of 10 and a is an integer, are exactly equal to a 
decimal. For instance, there is no decimal which is equal to 1/3, for if 1/3 
Were equal to a/10°, where a is an integer prime to 10, we would have 
3a = 10°, 

Which is impossible, since 3 does not divide 10. 

On the other hand, if our number system were to the base 3, 1/3 
Would be equal to a decimal,* whereas 1/10 could not. 


* This probably would be called a “trimal.” 
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However, the decimal may be used to approximate any rational 
number to any desired degree of accuracy; more precisely, for any positive 
rational number 7 and any small positive number e, we can find a decimal 
ἃ such that 0 <r—d<e. For instance, 


1,3 — 3 = 1/30, 1/3 — 33 = 1/300, ---, 1/3 — .38 --- 3 = (1/3)10~, 


where in the last decimal there are k threes. Thus by choosing k to be 6, 
we can make the decimal closer to 1/3 than 10~, and so forth. 

This process of approximation works well when we are concerned” 
with the decimal approximation of a rational number. However, if it is 


to be used to define new kinds of numbers, a slightly different point of 


view is necessary. If we write 
dy = 833 +++ 3, 
where there are k threes in the decimal, then the sequence 
di, de, dy, ““" 


may be said to define the number 1/3 in the sense that 1/3 is the smallest 


number which is not less than any d; of the sequence. Consider the se- 
quence of decimals: 


(1) 1, 1.4, 1.41, 1.414, 1.4142, 1.41421, ---. 


This is a sequence of rational numbers whose squares approach 2 from 
below. For instance, 1.414 is included because 1.414? is less than 2 and 
1.415? is greater than 2. For the next entry we could compute 1.4141*, 
1.4142, 1.4143? and find that 2 lies between the second and third, hence 
list the second. By continuing far enough in the sequence, we can find a 
number whose square is as close as we please to 2. 

Now, it is rather clear that the process just described yields a se- 
quence of numbers having the property that by continuing far enough we 
can find a number of the sequence as close as we please to V2. (To para- 
phrase a once popular song: “No matter what you name, I can get 
closer.”) This may be formally shown as follows. By our process, for 
any positive integer n, we find a number z, of the sequence having the 
property that - 


“ἂς 2)-ς(, + 1/10"). 
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Since all the numbers involved are rational, we may use their properties 
to see that 


2 — αἰ < (x, + 1/10")? — αὐ = 27,/10" + 1/10" < 5/10", 


since 2, is less than 2 and 1/10” < 1/10". This shows that by choosing n 
large enough we may make 2? as close as we please to 2. 

This sequence does not define a rational number in the sense that the 
decimal defines 1/3, for, as we now show, there is no smallest rational 
number greater than all the decimals of this sequence. To show this, 
suppose there were, and let it be designated by r. First, r? cannot be less 
than 2 because more than one number of the sequence (all of which are 
rational numbers) would have a square closer to 2 than r? and hence would 
be greater than r. Second, r? could not be greater than 2, for then one of 
the decimals 


2, 1.5, 1.42, 1.415, 1.4143, 1.41422, --- 


would be smaller than r and yet greater than all the decimals of (1); each 
such decimal of the above set is rational. This would contradict our as- 
sumption that r is the smallest. Finally, we have shown that γ2 = 2 is 
impossible if r is rational. Thus there are sequences of decimals of the 
form 


(2) Υ, 1.@1, 7.102, 7.010202, " 


where r is a natural number or 0 and each a; is a digit, which do not de- 
fine a rational number in the sense above. We then define a new kind of 
number called a real number as follows: 


Definition: With every sequence of decimals of the form (2) we as- 
sociate a real number R which, by definition, is considered to be so placed 
among the rational numbers that 


1. R is greater than or equal to each number of the sequence. 
2. If Κ 15 any rational number greater than or equal to every number 
of the sequence, then S > R. 


Thus in this sense 1/3 is the real number defined by the sequence 
.B, 88, 333, 3333, ..., 


since it is greater than each number of the sequence; and if S is any ra- 


34 The Development of the Number System 


tional number greater than or equal to each number of the sequence, 
S > 1/3. 

On the other hand, there is no rational number defined by the se- 
quence (1), and we designate by ν ὦ the number defined by that sequence. 
A real number which is not rational we call an irrational number. 


The term “real number” includes all rationals, for the decimal ex- 


pansion of a rational number defines in this way the rational number. It 
includes integers, since 5 can be written 5.0000 ---; and 2/5, since it is 


40000 ---. However, as we have seen, it also includes numbers which — 


are not rational. 


We have thus defined a real number to correspond with every se- — 


quence of decimals of the form (2). Is it possible for two sequences (2) 
to define the same real number? The answer is given in the following 
theorem which we shall prove. 


Theorem 1.11. Two distinct sequences (2) define the same real num- 


ber if and only if they are identical for the first k digits, k being some non- 


negative integer, and from that point on, one has the sequence of digits — 


a000 --- and the other has the sequence 0999 ---, where a = b+ 1. 

To simplify notation, first notice that if we have two decimal expan- 
sions, we may multiply or divide both by a large enough power of 10 to 
make the dissimilar portions begin immediately after the decimal point. 
(This is on the assumption that we may deal with real numbers as with 
rational ones.) Then, assuming the cancellation property for addition, 
the two numbers will be equal if and only if the dissimilar portions of their 
decimal expansions are equal. Thus we may consider our decimal ex- 
pansions to be . 

5 = αατάαςφῦς""") R= -bibobs. 


We may, without loss of generality, assume a; > δι. 

First, if a: = δι + 2, then the rational number c, where 106 = ὃ. +1, 
will be greater than all decimals in the sequence for R. Hence R < c. 
However, c is less than all decimals in the sequence for S, a fact which 
shows that c cannot be S. Thus this case cannot happen. 

Second, suppose a; = ὃ. +1. Then the rational number a,/10 will 
be less than all the numbers in the sequence for S unless all a; after the 


first are zero. On the other hand, if ὃς < 8, then the rational number — 


b;/10 + 9/100 will be greater than all the numbers in the sequence for R 
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and less than those for S. Notice that a,/10 is greater than all the num- 
pers of the sequence for S. Thus, if both sequences represent the same 
number, we must have S = a;/10 and b,=9. Repetition of this argu- 
ment yields the desired result. 

Since a decimal expansion can be obtained for =, it is a real number. 
Many equations have one or more real roots: χὸ — 5 = 0, χ' --,ἱ. -- ὃ -.0, 
and so forth. 

Negative real numbers can be defined as we defined negative integers. 
It can be shown that they obey all the laws of the rational numbers, but 
we omit this because it is rather difficult. 


1.12. Complex numbers 


Though many equations have real roots, some have none. There is 
no decimal expansion of a root of x*-+ 1 = 0, since for z real the least 
value of 2? is 0, and thus z7-+1> 1. There can be no decimal whose 
square plus 1 is closer to 0 than 1. However, mathematicians are insatia- 
ble, and they have created a number whose square is —1 and call it V/-1 
or i. By definition, 13 = —1. Any number of the form a+ δὲ where a 
and b are real numbers is called a complex number. If γέ 0, it is called 
imaginary; if a = 0 γέ ὃ, it is called pure imaginary. Furthermore, we 
define a+ bi = c+ di if a=c, ὃ =d, for a,b,c,d real, and we define ad- 
dition and multiplication by 


(a+ bi) + (c+ di) = (ate) + (ὃ  α)ὲ 
(a + δῦ (ὁ + di) = ac + adi + bei + bdi? = (ac — bd) + (ad + be)i. 


These numbers have all the fundamental properties of the rational num- 
bers. 

It is a remarkable fact that any polynomial in x with complex co- 
efficients is equal to a product of factors of the form x — c where c is a 
complex number, that is, any polynomial 


Ont” + On) + Ont"? + +++ Ἔ aX + Ao, Gn γέ 0, 
where a; are complex numbers, can be expressed in the form 
n(x τῶν C1) (x we C2) hes (x — Cn), 


where the c; are complex numbers. Hence we have reached the end of the 


a 
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road as far as solutions of equations are concerned; we need no new kinds 


of numbers to express the roots of any equation whose coefficients are © 


numbers of the kinds already discussed. 

To recapitulate: We started with the natural numbers and could not 
solve the equation a + z = b, when a and ὃ are natural numbers and a is 
greater than b. Hence we invented negative integers and then could 
solve all such equations where a and ὃ are integers, positive or negative. 
However, with integers alone, we could not solve ax = ὃ where ὃ is not a 
multiple of a. Thus we invented rational numbers so that every equation 
ax = Ὁ, where a and ὃ are integers and a ¥ 0, is solvable. As it happened, 
all such equations where a and ὃ are rational numbers are also solvable in 
rational numbers. 

However, z? — 2 = 0 is not solvable in rational numbers, a highly 
unsatisfactory state of affairs, which bothered the Greeks very much over 
two thousand years ago. Therefore, we invented the real numbers and 
saw that this equation had real roots. Since 2? + 1 = 0 has no real roots, 
we invented imaginary and complex numbers, and the process is complete, 
as indicated above. For this reason, the set of complex numbers is some- 
times called complete, since it includes real numbers, rational numbers, 
and, in fact, all the numbers of this chapter. 


Exercises 1.12 


1. Classify each of the following as being a natural number, integer, ra- 


tional number, irrational number, real number, complex number, and © 


imaginary number: 
5, —8, —5/4, V/2/V/8, 4, 6, roots of χ᾽ — 7 = 0, roots of 22 + 5 = 0, 
m, sin 45°, 
2. Are the sum and product of two rational numbers necessarily rational? 
Are the sum and product of two irrational numbers necessarily irrational? 


3. Is the sum of a rational number and an irrational number ever ra- 
tional? Is the product of a rational number and an irrational number ever 
rational? 


4. If a+ br =c+dr, where a,b,c, and d are rational numbers and r ir- — 


rational, show that a = c and ὃ = d. 
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5. Answer Questions 2, 3, 4 with “rational” and “irrational” replaced by 
“real” and “imaginary.” 

ὁ. Suppose we defined a + δὲ > c+ di ifa >cora=cand ὃ >d, with 
a,b,c,d real numbers. If x,y, and z are three complex numbers, which of 
the following properties of inequalities would hold? 


a. & > Yy, y > z implies x > z. 
b, 2 > 0, y > z implies zy > zz. 


7. Show that there is a complex number satisfying the equation 27 +7 = 0 
wherez = V—1. 
8. Find the decimal expansions for each of the following rational numbers, 
and notice what properties they have in common: 

1/3, 1/5, 1/7, 1/11, 1/18, 2/7, 3/7, 11/7, 1/17. 
Can you prove any of these properties? Keep these results on hand for 
reference in the following chapter. 
9. If the symbol (a/b) in Exercise 14 of Section 1.10 is identified with 
a+ bi, show that 


a. (a/b) + (c/d) as defined is identified with (a + bi) + (c+ di). 
b. (a/b) (c/d) is identified with (a + bz)(c+ dz). 


sis 


Repeating Decimals and Congruences 


2.1. Repeating decimals 


We shall begin by exploring certain interesting properties of decimals, 
and in the process develop certain fundamentals of the theory of numbers — 
which not only serve to explain these properties but are also useful in 
many other connections. A repeating or periodic decimal (Ref. 1) is one ~ 
in which the same sequence of digits recurs and continues to recur from a — 
certain point on. Some examples are 


1212121212 --- ;3.4156156156 --- ; 2.30000 --- , 


where we have underlined the recurring portion which we shall call the — 
repetend. Every repeating decimal may be expressed as the quotient of 
two integers, that is, a rational number. This may easily be seen from 
two examples: | 
Examp.e 1. To find the rational number n equal to 4.123123123 --- , 


see that 1000n = 4123.123123 --- and hence that 1000n — n = 4123.0 - 4 
= 4119. Hence 999n = 4119, and n = 4119/999. 
EXAMPLE 2, ἢ = 412.3412341234 --- . Then 10000n — n = 4123000, — 
which implies that n = 4123000/9999. What is the relationship between — 
the power of 10 by which n is multiplied and the repeating decimal? 
To prove formally that every repeating decimal may be expressed as 
the quotient of two integers, let N be any repeating decimal. By multi-— 
plying N by 10*, for k an integer (not necessarily positive), we may make — 
10*‘N, which we denote by M/, have the property that its repetend begins 
immediately after the decimal point. Thus we may write M in the form 
38 
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M =a+b-10-"+b-10-+b-10-"+---, 


where n is the number of digits in the repetend and ὃ is the repetend. 
Then 
105. M =a-10"°+6b+60-10"+0-10"+:::, 
M = a+b-10°*+6-:10"+--:. 


Subtraction gives 
(10" — 1)M = 10a +b -- α, 
and hence M is the rational number 
10°a +b -α b 
ia (ee 
Thus WN is 10. multiplied by the value above for M. 
Conversely, it is true that the decimal expansion of any rational 
number is repeating if, as above, we call a terminating decimal “repeat- 
ing” in the sense that 0 recurs in the expansion. See Exercise 2, below. 


A short table of repetends 

1/7 = .142857 --- 
1/13 = .076923 --- 
1/19 = .05263 15789 47368 421 --- 
1/17 = .05882 35294 11764 7 --- 
1/23 = .04347 82608 69565 21739 13 --- 
1/29 = .03448 27586 20689 65517 24137 931 --- 
1/31 = .03225 80645 16129 - -- 
1/37 = 027 --- 


Exercises 2.1 


1. Find the rational number equal to 31.2145145 --- . 


2. If in a process of division a remainder appears which appeared before, 
why must the expansion repeat from that point on? Why in dividing one 
integer by another must eventually a remainder appear which occurred 
before? How may this be used to show that any rational number has a 
repeating decimal? , 
3. Use the development in Exercise 2 above to show that the number of 
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digits in the repeating part of the decimal expansion of a/b cannot exceed 
b—1. 
4. Show that the repetend of the decimal expansion for 1/m for any in- 


teger m occurs between the first two times that the remainder is 1 in the 


division, if m is prime to 10. 

5. Show that the results of the above exercise hold also if 1 is replaced by 
r, ἃ positive integer less than m, and hence that any such decimal repeats 
from the beginning. 

6. Compare the decimal expansions of each of the following sets: 


1/3 , 1/13 , 1/28 ; 1/7, 1/17, 1/37; 
1/9, 1/19, 1/29. 


7. Each of the following letters represents a digit chosen from 0 to 9 in- 
clusive, and no two letters represent the same digit. A fraction, equal to 
a repeating decimal, is indicated. Find what values the letters must have: 


EVE 


Bip 7 ἸΑΙΚΤΑΙΚ..". 


2.2. Number of terms in the repetend of the repeating decimal 


The student’s experience with the previous exercises shows that when 
a remainder recurs, the decimal expansion repeats. For instance, in the 


division of 11 by 7, the first and seventh remainders are 4, and the repetend 
begins with the second digit of the expansion and ends with the seventh. 


In the expansion of 3/7, the first and seventh remainders are 2, and the 
repetend consists of the first six terms of the expansion. This must be so, 


in general, since what follows any point in the division depends only on — 


the remainder at that point. Furthermore, division by a number ὃ can 
yield at most b — 1 different remainders. Hence, if the first b — 1 remain- 
ders are distinct, the next one must have occurred before. This shows 


that the repetend of the decimal expansion of a/b can contain no more ~ 


than b — 1 digits. 
The above result can be seen more clearly if we notice that the first 


remainder in the decimal expansion of 1/7 is the remainder when 10 is — 
divided by 7, the second remainder when 10? is divided by 7, and the nth © 
remainder when 10” is divided by 7. Similarly, in the expansion of 11/7, 
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the successive remainders in the division process are the remainders of 
11, 11-10, 11-10*, 11-10%, and so forth, when divided by 7. There cannot 
be any more than six different remainders. 


2.3. Congruence notation 


There is a convenient notation due to Gauss (one of the greatest 
mathematicians of all times), which is of assistance in the above descrip- 
tion (Ref. 11, pp. 209 ff.). Instead of saying that 3 is the remainder 
when 10 is divided by 7, 2 when 10? is divided by 7, and the like, we write 


10 = 3 (mod 7), 105 = 2 (mod 7), 10 = 6 (mod 7). 


In general, 
a = ὃ (mod m) means a — b is divisible by m. 


It is read: α is congruent to b mod m (mod being short for ‘“modulo’’). 
Hence 100 is congruent not only to 2 (mod 7) but to 9, 16, 30, —5, etc. 

Thus we could express the results in the last part of Section 2 as 
follows: The remainders in the division of 1 by 7 are the numbers congru- 
ent, to 10, 105, 105, and so forth (mod 7); the remainders in the division of 
11 by 7 are the numbers congruent to 11 times 10, 105, 105, and so forth 
(mod 7). In general, if we divide a by b, the remainders in the process are 
those numbers congruent to α times the powers of 10 (mod δ). 

Another meaning which could be ascribed to the symbol 

a = b (mod m) 
would be as follows: The remainders when a and ὃ are divided by m are 
the same. That this is equivalent to the former meaning can be shown 
as follows. Suppose a = mq +7, ὃ = ms + ft, where r and / are nonnegative 
integers less than m; that is, r and ¢ are the remainders when a and ὃ are 
divided by m. Then α -- ὃ -- m(q—s) =r-—1#, and, if a — ὃ is divisible 
by m, then r — ὁ is also; and since r and ¢ are less than m, 
lr — εἰ « m, 

which * shows that r — i= Oorr =¢. On the other hand, if r = ἐ, we would 
have a — ὃ = mq — ms, which is divisible by m. 


* The absolute value of a number ὃ is written [δ and means the number —6 if ὃ is 
hegative, and -+-b if b is positive or zero. 
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In brief, we have just shown that if the difference of two numbers is 
divisible by m their remainders when divided by m are the same 
conversely. 


4110) 
΄ νη 


Exercises 2.3 


1. Use the congruence notation to find the first three remainders in th 
division of 11 by 21. 


2. Prove that if a-10" = α 10" (mod 7), where r > 8, thena:10"* =@ 
(mod 7). What does this show about the repetition in the expansion of 
a/7? 
3. If the number 7 in Exercise 2 were replaced by 11, would the same 
result hold? What if 7 were replaced by 15? What general conclusion ns. 
could you draw for 7 replaced by any integer δὴ 


4. Which of the following properties of congruences hold? Specify ar 
exceptions. 


. αὶ = ὃ (mod m) implies ac = be (mod m). 
. ar = br (mod m) implies a = ὃ (mod m). (Compare Exercise 8} ᾿ 
. a = ὃ (mod m) implies a+ ὁ = ὃ -Ῥ (mod ηη). 

. a = ὃ (mod m), c = d (mod m) implies ac = bd (mod m). 


ac oO 


5. If a = ὃ (mod m), prove that a* = δ" (mod m). Use this to find the 
remainder when 3 is divided by 10, that is, the last digit in 3. Whe 
is the last digit in 7”? 


2.4. Properties of congruences 


In Exercise 4 above you were asked to examine the truth of 


Theorem 2.4a. If a = ὃ (mod m), then ac = be (mod m). 

This is true, since a — ὃ divisible by m implies (a — b)c is divisible by 
m and hence ac — bc is also. 

You were also asked to test the truth of 


Theorem 2.40. If a = ὃ (mod m), thena+c=b-+c(modm). nis 
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is true, since a— ὃ = α -Ἐ ὁ -- (b+) implies that if one difference is di- 
visible by m the other is. 

On the other hand, ar = br (mod m) does not always imply that 
a = ὃ (mod m), as is shown by the fact that 6 = 27 (mod 21) but 2 49 
(mod 21). Suppose ar — br is divisible by m. Then (a — b)r is divisible 
by m; that is, m must be the product of two factors one of which divides r 
and the other a — ὃ. Let ἃ be the largest factor of m which divides τ; that 
is, let d be the g.c.d. of m and r. Then r(a— ὃ) divisible by m implies 
(a — b)(r/d) is divisible by m/d. However, r/d and m/d have no common 
factor greater than 1, since if g were such a factor, m and r would have the 
common factor dq which is larger than d. Hence m/d divides a — ὃ, and 
we have proved 


Theorem 2.4c. If ar = br (mod m) and dis the g.c.d. of m and r, then 
a = b (mod m/d). 
We leave to the student the proof of 


Theorem 2.4d. If f(x) is a polynomial in x with integer coefficients, 
then a = ὃ (mod m) implies that f(a) = f(b) (mod m). 

Congruence is a kind of equality in that it has the properties of 
equality given in Section 1.2 (see Exercise 13 below). Theorems 2.4a and 
2.4b show that multiplication and addition are well defined relative to 
congruence. From this point of view congruence (mod m) can be thought 
of as a separation of all the integers into classes, each class containing all 
the numbers congruent to each other (mod m). For instance, if our mod- 
ulus is 3, the integers are divided into three classes as follows: 


Class of 0: -,—-9,-6,-3,0,3,6, 9, 
Class of 1: οὐ πϑι τι 2,1, 4,7, 10) +) 
Class of 2: oo —7,—-4,-1,2,5,8, 11, 


In the sense of congruence, any number of one class is “equal” to any 
other number of that class. From this point of view, adding 7 and 4 is the 
Same as adding 1 and 1 or —5 and —2, since all three sums are in the class 
of —1. Similarly, 7-8 = 2 (mod 3), since 1-2 = 2 (mod 8). | 
For ease of reference we exhibit below the addition and multiplication 
tables of integers mod 7. 


— ut ey ae μα 


SS eee ὯΝ 


——————— 
a ΄ἔ- Ὁ 
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Addition (mod 7) Multiplication (mod 7) 
ase 46 ὃ 1.2 δ 4-68 
112 3 46 6 OU 143 26 4 3 
27a 46 6 0 1 Aa & 8k “3 
Sin somo ft F sis 6 2°o 2 
ai6 6 0 123 3 44 1°6°2 & 
SiG 012 8 4 615 316 4 
Sin it 3:2 8 4°35 676° 6°4@°3 2 


Exercises 2.4 


1. Write out the addition and multiplication tables mod 11 and mod 12, 


2. What properties do the tables mod 7, mod 11, and mod 12 have? 
Can you prove any of them for a general modulus m? 
3. Use the tables to solve each of the following equations, when solutions 
exist, for ὃ = 7, 11, 12. 

a. 5 + «= 3 (mod ὃ). 


b. 5a = 13 (mod ὃ). 
c. 41 = 13 (mod δ). 


Where a congruence is not solvable, give reasons for the lack of solution. 


4. Judging from your experience in solving the equations above, what 
would you guess to be the condition in order that each of the following 
congruences be solvable? 


a.a+«=b(modm). 
b. ax = b (mod m). 


Go as far as you can in proving your conjecture. 
5. Given x = m (mod a), x = m (mod ὃ), show that (a,b) = 1 implies 
a =m (mod ab). What would your conclusion be if (a,b) = d > 1? 

ὁ. If ab = 1 (mod m) and be = c (mod m), show that x = ac (mod m) 
and hence a could be called the reciprocal of ὃ (mod πη). | 


7. Show that a,10" + a,_,10*'?+ «+: +a,10 + a, where the a,’s are 
integers, is congruent to a, + a,1+ +++ +a: + αὐ (mod 9) and hence 
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that any number is congruent (mod 9), to the sum of its digits. How 
would you express this without the use of the notion of congruence? 

8. How would you perform the following trick? You ask a friend to 
“think of a number” and then form another number by mixing the digits. 
He then is to subtract the smaller from the larger and tell you all but one 
of the digits in the difference. You then tell him the missing one. For 
instance, he may choose 57689; he scrambles the digits to get 98567 and 
subtracts, getting 40878. He tells you of the digits 0,4,8, and 8, and you 
inform him that the other one is 7. 


9. Find an expression in Gp, Gn—1, °** » 41,4 to which 


An10" + α,-1105} + +++ + 4:10 + a 


is congruent (mod 11). Is there for this result an analogue to the trick in 
Exercise 8? (For other tests of divisibility, see Ref. 9.) 

10. If N is a two-digit number and M the number obtained from N by 
interchanging the digits, show that N — M is divisible by 9. Find all such 
numbers N for which N — M = 27. 

11. Show that (x + y)? = x? + y” (mod p), where p is a prime number. 
12. Prove Theorem 2.4d. 

13. Prove that congruence has the four properties of equalities given in 
Section 1.2. 

14. Teams A and B contain six men each. How does the multiplication 
table (mod 7) give a plan by which each man of team A plays each man 
of team B, six matches being played simultaneously? 

15. Prove that a number which is a square must have one of the following 
as its last digit: 0,1,4,5,6,9. What can you say of the last two digits? 


2.5. Congruential equations 


It is easy to show that the congruence a + z = ὃ (mod m) is always 
Solvable, since b — a = x is a solution. We partially prove 


Theorem 2.5a. The congruence az = ὃ (mod ™m) is solvable if the 
&.c.d. of a and m is 1, and then there is only one solution (mod m). | 
We give the proof for m a prime number Ὁ and leave to the reader the 
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proof for m composite. Let α be an integer not divisible by p and writ 
the numbers 


(1) a, 2a, 3a, +++ , (Ὁ — 1)a, pa. 


No two of them are congruent (mod p), since ra = ya (mod p) would 
imply « = y (mod p) by Theorem 2.4c. However, there are p of them, 
and thus p classes represented. Hence, since there are only p classes 
(mod p), one of them must be the class of ὃ; that is, ὃ must be congruent 
(mod p) to one of the numbers (1). 

Notice that Theorem 2.5a implies that az = 1 (mod m) is solvable if 
and only if (am) = 1. When this congruence has a solution ὃ, then Ὁ is 
essentially the reciprocal of a, and to solve the congruence 


ax = ὁ (mod m) 


we need merely multiply by ὃ and have abr = x = be (mod m), as 
seen in Exercise 6 above. 
Another consequence of Theorem 2.5a is 


Theorem 2.5b. The congruence az = b (mod m) is solvable if and 
only if the g.c.d. of a and m divides ὃ. If it is solvable there are (a,m) 
distinct solutions. 

To prove this, suppose ar = b (mod m); then ar — b is divisible by m, 
and any factor common to a and m would have to divide b. Suppose 
(am) = g and g|b. Then az = ὃ (mod m) if and only if 


(a/g)x = b/g (mod m/q). 


Since (a/gn/g) = 1, this has just one solution (mod m/g), and if thi 
solution is 2%, then the solutions of the congruence az = ὃ (mod m) are 


» % + (g — 1)m/g. 


There are better ways of solving a congruential equation than that 
suggested by the proof, but it does supply a definite way of finding a solu- 
tion of such a congruence. The reader should apply this method to the 
solution of the congruences in Exercise 3b and 3c above. 

A similar method of proof may be used to give the following useful re- 
sult sometimes referred to as Fermat’s Little Theorem (Ref. 11, pp. 54 ff.). 


Xo, % + m/g, τὸ + 2m/g, "5" 


Theorem 2.5c. If a is not divisible by a prime number p, then 
a? = 1 (mod p). 
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To prove this, consider the set of numbers in (1), omitting pa from 

the list. The set is then congruent to 1,2,3,---, p—1 in some order, 

and hence the product of the numbers of the set is congruent to 
ῳ — 1)! (mod p). Thus we have 


a?-1(p — 1)! = (p — 1)! (mod p). 


Since (p — 1)! is not divisible by p, Theorem 2.46 shows us that we may 
divide by the factorial p — 1 to get our desired result. 

This theorem of Fermat establishes a property of the repeating part 
of the decimal expansion of 1/p when Ὁ is a prime number not dividing 10, 
for we have 


10° = 1 (mod p), 


which means that in the division of 1 by p, the (p — 1)" remainder is 1. 
Furthermore, the (p — 1)" remainder in the division of a/p is the same as 
the remainder when a is divided by p. 


Exercises 2.5 


1. Use the method of proof of Theorem 2.5a to solve the congruence 
5a = 3 (mod 11). 
2. Prove that the number of terms in the repetend of the decimal ex- 
pansion for a/p, where p does not divide a, is a divisor of p — 1, p being 
& prime number. 


3. Prove that for every prime number p, except 2 and 5, there is a 
multiple p whose every digit is 9. 

4. If (105 -- 1)/p is a positive integer, what is its last digit if p = 1 
(mod 10)? If p =3 (mod 10)? If p =7 (mod 10)? If p = 9 (mod 10)? 
5. Prove Fermat’s Theorem by showing that 


((,. 111 ὃ’ = (1+1+--: 


Where the number of 1’s is less than p. 
6. Prove that a? = a (mod 7) for p a prime number. 


7. Prove that if m? +n? = 0 (mod p) then m? + n? = 0 (mod p*) for p 
80. odd prime number. (Ref. 3, p. 50.) 


+ 1) (mod p), 
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8. State and prove the theorem corresponding to Fermat’s Little Theorem 
for a composite number m instead of p. (In the proof, replace the set 
1,2,3,---, p — 1 by the positive numbers less than m and prime to m.) 
What are the consequences of this theorem in connection with the decimal] 
expansion of a/m, where m is a composite number not divisible by 5 or 
Why is the last restriction necessary? 

9. Let αὶ and y each take the following (m + 1)/2 values: 

0, 1, 2, 3, ate (m — 1)/2, 

where m is an odd integer. If a is an integer, show that no two of the re 
sulting (m + 1)/2 values of « + a are equal. Since (m+ 1)/2+ (m+ 1)/2 
= m+ 1 and there are only m different numbers (mod m), show that at 
least one value of « + a is congruent to one value of y and hence that 


x+a=y (mod m) 
has a solution 2,y for which 0 < x < m/2 δὰ 0 < y « m/2. 
10, Using the same principle as in Exercise 9, show that 
ax = y (mod p) 
has a nontrivial solution in integers z and y less in absolute value than 
\/p for p, a prime not dividing a. 
11. Can Exercise 10 be extended for p, a composite number? 


12. Does α = b? (mod m) imply a = b (mod m) or a = —b (mod m)? 
What is your answer if m is a prime or a power of a prime? 


2.6. The Euler Phi-function 


An important number theoretic function is the so-called Euler Phi- 


function defined as follows: ¢(1) = 1 and, if m > 1, ¢(m) is defined to be 


the number of positive integers less than m and prime to m. For instance, 
φ(12) = 4, since 1,5,7,11 are the positive integers less than 12 and prime 


to 12. A theorem for composite numbers analogous to Theorem 2.56 is 


known as Euler’s Theorem and may be stated as follows: 


Theorem 2.6a. If (a,m) = 1, then a®” = 1 (mod πηι). 
To prove this, we let 


(1) αι, G2,*** , Qs, 


! 
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where t = 6(m), be the positive integers less than m and having no factors 
in common with m, in other words, the positive integers less than m and 
prime to m. If a is any number prime to m, as in the proof of Theorem 
2.5a, we see that 


G01, As, ee » AAs 


are all prime to m and no two are congruent (mod m). Hence they are 
congruent in some order to the numbers of (1). Taking the product of 
both sets, we have 


a'(ay@2 +++ αὐ = aid2**+* αι (mod m), 


and, dividing both sides by the product of the a’s, we have our theorem. 
The a’s in (1) constitute what we call a reduced residue system 
(mod m); in fact, a reduced residue system is any set of ¢(m) numbers 
congruent to the numbers of (1). Similarly, any set of m numbers in- 
congruent (mod m) we call a complete residue system (mod m). 
A generalization of the result in Exercise 2, above, is a direct conse- 
quence of 


Theorem 2.6b. If ais prime to m and r is the least positive exponent 
for which αὐ = 1 (mod m) and if at = 1 (mod m), then r divides 8. 

To prove this, suppose we write out the powers of a (mod m). The 
first r powers are distinct (mod m), since at = α" (mod m) with r > 1 > u 
would imply αὐ" = 1 (mod m) and ¢ — u would be positive and less than 
r, contrary to the assumption about r. 

Hence the powers of a may be written in rows as follows: 


a,@,a,°-°, a zl, 
art! grit... ΟἿ = 1, 
ar, grt? ..., git = 1, 


9 
where only the last power in each row is congruent to 1 (mod m). If, then, 
α' = 1 (mod m), it must occur at the end of one of the rows and hence 
must be an integral multiple of r. This proves our theorem. In view of 
Euler’s Theorem we have 


Corollary 2.6b. If r is the least positive exponent for which a" = 1_ 
(mod m), then r is a divisor of ¢(m). We call r the exponent to which a 
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belongs (mod m). This immediately implies that in any decimal expansion - 


of a/m, where m is prime to 10, the number of digits in the repetend is a 
divisor of #(m). 


2.7. Factorization by means of Euler’s Theorem 


Certain types of numbers may be factored easily by use of Euler’s 


Theorem and the properties of congruences. Suppose we seek the di- 
visors of ἡ = 5’— 1. From Theorem 2.60 we know that if r is the least 


exponent of 5 such that 5° = 1 (mod p), for any prime p dividing n, then 


r must divide 7 since 5’ = 1 (mod p). 


Now, r = 1 would imply 5— 1 =0 (mod p) and hence p= 2. In 
that case, 5’ — 1 = [([63)5ὅ -- 1 Ξε 18 .ὅ -- Ἰ Ξε -- 1 Ξε 4 (mod 8). Thus, 


if p γέ 2, r must be different from 1 and a divisor of 7, hence r = 7. Hence 


by Corollary 2.6b the only prime divisors of n beside 2 are primes p for — 


which 
p—1=0 (mod 7); 


that is, primes p = 1 (mod 7). However, p is odd and therefore must be 


1 more than an even multiple of 7; that is, 
p = 1 (mod 14), 


If n/4 has a prime factor p less than n/4 it must have one not greater than 


/n/2 because certainly p, a factor, implies that either p or n/4p is not — 
greater than /n/2. Thus the only possible primes which need to be tried — 


are those less than 139 and congruent to 1 (mod 14). These primes are 
29, 43, 71, 113, 127. 


Now 5’ = (—4)? - 5 = —64-5 = —30 = —1 (mod 29). Similar computa- 


tion shows that n is not divisible by 48, 71, or 118, and hence that n/4 is 


& prime number. 


A slight variation of method enables us to consider n = δῖ +1. 


A little computation shows that n = 2 (mod 4), n is divisible by 3 but not 
by 9 and, by the above, n = 0 (mod 29). Dividing n by 2-3-29 gives 
449, which is a prime number. 
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2.8. Further properties of repeating decimals 


Consider the repetend occurring in the decimal expansion of 1/7, 
namely, 142857. If we multiply this number by 3 we get 428571, by 2 we 
get 285714, andsoon. For every multiplier from 1 to 6 we get a so-called 
cyclic permutation of the digits of the original repetend; that is, if we 
write two repetends in succession: 

142857142857, 


multiplication by 3 gives us the number formed by the six digits beginning 
with the 4, multiplication by 2 gives the number composed of the six 
digits beginning with the 2, and so on. One may also accomplish the 
same result by writing the digits of the repetend in order around a circle; 
hence the term cyclic. Similarly, if we multiply the repetend of the ex- 
pansion of 1/17 by the numbers from 1 to 16, we would have the sixteen 
cyclic permutations of the digits in the repetend. Why is this so? Can 
the same conclusion be drawn for 19,23,13,31,37? If so, why; if not, 
what can be said? (For example, if we multiply the repetend for 1/19 by 
the numbers from 1 to 18, do we get the eighteen cyclic permutations?) 

There is another rather remarkable property of such repetends. For 
instance, the sixteen digits in the repetend of 1/17 are the digits from 0 to 
9, inclusive and, in addition, the six which occur in the repetend of 1/7. 
The repetend of 1/47 has 46 digits which consist of four sets of 0 to 9 and 
the six which occur in the repetend of 1/7. Suppose we have any prime p 
whose last digit is 7 and whose repetend N contains p — 1 digits. If we 
multiply N by the numbers from 1 to p — 1 in succession, each digit oc- 
curring in N will be transferred once and only once to the right end of the 
number. Hence the digits in N will be just those that occur at the right 
end of the numbers ΟΝ for c = 1,2, ---, p— 1. The last digit in N will be 
7 (see Exercise 4 of Section 2.5). If we multiply N by the numbers from 
1 to 10, inclusive, the last digits will be congruent to 7c (mod 10) for 
¢ = 1,2, +--+, 10 and hence will be the numbers 0,1, ---, 9 in some order. 
This will be true for each interval of 10 less than . This accounts for the 
sets of numbers 0 to9in N. The remaining six digits will be obtained by 
multiplying N by p—1, p— 2, » -- 3, Ὁ -- 4, p—5, Ὁ -- 6, whose last 
digits are 6,5,4,3,2,1, respectively; and N = 7 (mod 10) shows that the 
Products are congruent, respectively, to 2,5,8,1,4,7, which are the digits 
Occurring in the repetend of 1/7. 
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Recall that if the g.c.d. of ὃ and c is g we write (b,c) = g. And if 


(b,c) = 1, we call ὃ and ¢ relatively prime. There is a rather simple — 


formula for ¢(m) for any integer m which may be obtained easily once we 
have proved 


Theorem 2.9. If b and care relatively prime integers, ¢(be) = $(b)¢(c). 
To prove this, let b:,b2, --- ,b, be a reduced residue system (mod δ), 
and let ¢1,¢2, +--,¢, be a reduced set (mod c). 


The congruence 
ba + δι = c; (mod c) 
has a solution 21; for each c;, 7 = 1,2, ---,s, since (b,c) = 1. 
Thus 
(1) bay + δι, bay, + δι, brig + δι, τ, bai, + δι 


are all congruent to ὃ, (mod 6) and congruent in order to 61,02, +--+ , 
c,(mod c). 
The congruence 


ba + bs = 6; (mod c) 
has a solution x2; for each value of c; and the members of the set 
(2) bro + be, bt22 + bo, plata bao, + ὃ. 


are all congruent to ὃ: (mod δ) and congruent in order to ¢1,¢2, --+,C, 
(mod c). Thus for each ὃ, there will be a set of s numbers congruent to 
δ: (mod b) and congruent in order to ¢1,¢2, +++ ,¢, (modc). Each of these 
is prime to ὃ and ὁ and hence to bc; no two are congruent both (mod δ) 
and (mod c); and hence no two are congruent (mod bc), and there are rs 
of them. Furthermore, every number prime to bc must be congruent to 
one of the ὃ; (mod ὃ) and one of the c; (mod 6) and hence is one of the rs 
numbers. 

We have thus shown the existence of a set of rs numbers having the 
following properties: 


1. All are prime to be. 
2. No two are congruent (mod be). 
3. Every number prime to bc is congruent to one of the set. 


2.9. Properties of the Euler Phi-function 53 


Thus, each positive number less than be and prime to be is congruent 
(mod be) to just one of the set, and hence, there are exactly rs such num- 
bers. This completes the proof. 

Let m be expressed as a product of powers of distinct primes as 
follows: 


a, 


m= p,p,2 ++ p.’. 


Theorem 2.9 then shows us that ¢(m) = ¢(p,") $(p,”) --- (p-"), which, by 
Exercise 4 below, is 


"ΞΡ )@P=P2) Of =p) 


and which may be written ¢(m) = m(1 — 1/pi)(1 — 1/p.) «++ (1 — 1/p,). 


Exercises 2.9 


1. For what numbers m is ¢(m) odd? When is ¢(m) = m/2? 

2. Prove that if r is the least value of « for which a? = b? (mod m), where 
a and ὃ are prime to m, then αὐ = ὃν (mod m) implies that y is divisible 
by r. 

3. Prove that if r is the least value of x for which a? = —1 (mod m) and 
if αν = 1 (mod m) then 2r divides y. 

4. Prove that if p is a prime number, ¢(p") = ρ" — p™. 

5. Show that 2? — 1 is prime for p = 13 and composite for p = 11 and 23. 
6. Show that 2 + 1 is a prime number. 

7. If pisa prime whose last digit is 3 and if the repetend of 1/p contains 
p — 1 digits, what can be said about the digits in the repetend (see Section 
2.8)? What is the case for p whose last digit is 1 or 9? 

8. The repetend of 13 contains only 6 digits. Can the above results be 
extended to such a case? 

9. If αν,α:, --+,a; is a reduced residue system (mod m), show that bay, 
bas, --+,ba, is a reduced system if (m,b) = 1. 

10. Show that x” — x = 0 (mod 15-17-32) for all odd integers x. 

11. What is the largest number m you can find such that 29-2 =0 
(mod m) for all x? 
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12. Show that the number of positive fractions in lowest terms between 
0 and 1 and whose denominators are not greater than n is 


$(2) + (3) +++ -+4(n). 


13. Write the numbers bz + b; in the proof of Theorem 2.9 for ὃ = 5 and 


c= 3. 


14. If bi,bo, ---,b, constitute a reduced residue system (mod ὃ) and — 


€1,C2, ***,C, a reduced system (mod c), prove that every number d in the 


reduced system (mod bc) is congruent to some ὃς (mod δ) and some δὲ 


(mod c). Prove that any number d congruent to some ὃς (mod ὃ) and 


some 6; (mod 6) is in a reduced residue system (mod bec). How may this . 


be used to prove Theorem 2.9? 


15. Prove that the sum of the numbers of a reduced residue system 


(mod m) is divisible by m if m > 2. 
16. Answer the questions in the first paragraph of Section 2.8. 


17. If r is the least power of a such that α΄ = 1 (mod p) and (8,7) = 1, 


show that r is the least power of αὐ such that (α)" = 1 (mod p). 


2.10. Multiplicative functions 


The Euler ¢-function is called a multiplicative function, since (b,c) = 1 _ 
implies (bc) = $(b)¢(c). There are various number-theoretic functions — 


having this property. Two such functions are a(n), the sum of the divisors 


of n, and d(n), the number of divisors of n (see Exercises 9 and 10 after _ 


Section 2.13). 


Recalling that djn means “ἃ divides n,” we prove a general theorem — 


on multiplicative functions which we shall find is very fruitful. 
Theorem 2.10a. 11 f() is a multiplicative function and 
F(n) = df (ὦ, 


then F (n) is a multiplicative function. 


First of all, the notation should be explained. The quantity on the 


right is the summation of f(d) over all divisors ἃ of n. For instance, 


F(6) = 71) + f2) + 719) + £6), POR) = 7) +f), PR) = fA) + JB). — 
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Then F(2)F(3) = fF) ἘΣ) + MFC) + ΤΩ (3), which is equal 
to F(6) if f is multiplicative. To prove the theorem we assume (b,c) = 1 
and have 


F(bc) = ΣΙ = pz hada 


2S (r) ΣΙ = F(b)- F(c). 


The first equality is by definition; the second holds since any divisor of 
be must be the product of a divisor of ὃ, and a divisor of c, since (b,c) = 1. 
The third equality results from the multiplicative character of the func- 
tion f. This completes the proof. 
Now ΣΦ - 1+ φῴ) + o(p*) +--+ + φίρ"), since the only di- 
p* 


visors of p" are powers of p, p being a prime. This shows that 
2,0) Site δον πὶ Ste Pt t-te, 
which is a particular case of the following theorem. 
Theorem 2.10b. 29(d) =n. 
From Theorem 2.10a we know that >9(@) is a multiplicative func- 
tion, since the Euler Phi-function is multiplicative. Hence 
240) = £90 FeO ἐ oO: 


where n = p.1p,? +++ Ὁ“. However, each sum is equal to the power of Ὁ 
contained therein, which proves our theorem. 


2.11. Wilson's Theorem and an allied result 


There is an interesting theorem called Wilson’s Theorem (Ref. 11, 
pp. 259 ff.), which a trick helps us to prove easily. 


Theorem 2.1la. If pis a prime, (p — 1)! = —1 (mod p). 

First notice that the converse of this theorem is easy to prove; 
namely, (m — 1)! = —1 (mod m) implies that m is a prime, for otherwise 
(m — 1)! has a factor in common with m. Hence the theorem gives a 
hecessary and sufficient condition that m be a prime. See Exercise 12, 
below, 
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To prove the theorem, write out the set of numbers 
L, 2,0, ***, p= 2p — 1, 


and let the ‘‘mate” of 2 be that number a, of the set for which 2a, = 1 
(mod p). (We know there is such a number, since 2 is prime to p.) Con- 
versely, the mate of a, is 2. Then we can associate 3 with its mate, and 
so on. If a number ὃ were its own mate, we would have b? = 1 (mod p); 
that is, (6 — 1)(6+ 1) =0 (mod p), which implies that ὃ = 1 or p— 1. 
Hence omitting these two numbers, we have 


(p — 2)! = (2a2)(--)(--) “τ (1) (mod p), 


where each parenthesis contains two numbers whose product is congruent — 


to 1 (mod p). Hence we have 
(p — 1)! = 1(1)(1) --- (1)(p— 1) = -- (mod p). 
This is perhaps clarified by an example. Take p=11. Then 


2x = 1 (mod 11) has the solution 6, and hence 2 and 6 are mates. The 
mate of 3 is 4, and so on. Then we can write 


10! = 1(2-6)(8-4)(5-9)(7-8)(10) = 1-1-1-1-1-(—1) (mod 11). 
A similar trick results in a proof of 


Theorem 2.11b. If r = (p — 1)/2, then (r!)*(—1)" = —1 (mod p) if 
p is a prime. 

This is proved by pairing the numbers 1,2, ---,»— 1 as follows: 
1 and —1, whose product is —1; 2 and p — 2, whose product is congruent 
to —2*; 3 and p — 3, whose product is congruent to —3?, ---;(p—1)/2 
and (p+ 1)/2, whose product is congruent to —r*, since (p+ 1)/2= 
p—(p—1)/2. Hence (p— 1)! = (—1)(—2*)(—3*) --- (—r*) (mod 5): 
However, there are r minus signs, which shows that (p — 1)! = (—1)"(r!)? 
(mod p). 


2.12. Perfect numbers 


"There is an interesting result connected with the o function, where — 


o(n) is the sum of the positive divisors of n. The number 6 has the prop- 


erty that it is the sum of its divisors not including 6; that 15,6 = 1+2+3. — 
(The positive divisors of a number less than the number are called its — 
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aliquot parts or proper divisors.) Similarly, 28 = 1+2+4+7+ 14. 
Any number which is the sum of its aliquot parts, that is, half the sum of 
all its positive divisors, is called a perfect number. It has been shown 
(Ref. 13) that no odd perfect number contains less than five different 
prime factors, but no odd perfect number has ever been found. On the 
other hand, Euclid developed a formula for all even perfect numbers. 
This may be found as follows: Let n be a perfect number written in the 
form n = 2“ ἴηι, where m is odd and q is greater than 1. Then o(n) = 2n 
impli 
ee o(2™)o(m) = 2¢m; 
that is, 

; (2° — 1)o(m) = 2em. 
Since 24 does not divide 2¢ — 1, it must divide o(m), and we may write 
a(m) = 248, which yields 


(2¢—1)s = m. 


Certainly m has two divisors: 8 and (2*— 1)s, whose sum is 2%s, whereas 
we have just learned that o(m)=2%. Hence 8 and 2?—1 are the 
only divisors of m, a fact which shows that s=1 and 2?— 1 is a prime 
p. In that case, n is a perfect number. Hence the even perfect numbers 
are those which can be written in the form 


(1) 2. ἴ(29 — 1) with 2¢— 1 a prime number. 


In fact, g must itself be a prime number, since if g = ab, 2¢ — 1 would have 
25 — 1 as a factor. 

However, q may be a prime number without 2% — 1 being a prime 
number. For example, 2"! — 1 is composite (see Exercise 5 in Section 2.9). 
The numbers 25 — 1 with q a prime are called Mersenne numbers, since 
it was Mersenne who conjectured that for g S 257 only the following 
values of g yield prime values for 2¢ — 1: 


2, 3, 5, 7, 13, 17, 19, 31, 67, 127, 257. 


The fact that Mersenne made five errors, namely, g = 67,257 yield com- 
posite numbers, and g = 61,89,107 yield primes, has been shown (Bulletin 
of the American Mathematical Society [1948], Vol. 49, pp. 378-380). 
Recently five more Mersenne numbers have been found with the aid of 
high-speed calculating machines, namely, for g = 521,607,1279,2203,2281. 
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(See two articles by Horace 8. Uhler, Vol. 18 [1952] and Vol. 19 [1953] 
of Scripia Mathematica.) 

The term “perfect number” had its origin in the mystical meanings 
of numbers. Amicable numbers are similarly defined. (Ref. 11, pp. 
96-100 and Ref. 14, p. 83.) 


2.13. Fermat numbers 


It is known that regular polygons with 2,4,8,16, --- sides may be 
constructed with ruler and compasses when the former is used only to 
draw straight lines and the latter only to draw arcs. The regular pentagon 
may also be constructed with ruler and compasses. In fact, it can be 
proved (Ref. 16) that the regular polygons which can be so constructed | 
are those of n sides, where 


n = QWpypo +++ Dry 


with & a nonnegative integer and with the p’s distinct primes such that 
p — 1 is a power of 2. Thus the polygons of less than 20 sides which can | 
be constructed with ruler and compasses are those having 


8, 4, 5, 6, 8, 10, 12, 15, 16, 17 


sides. If - 2+ 1 is a prime, it may be shown (why?) that r is a power 
of 2 and the numbers Ἷ 


οὐ 44 


for n a nonnegative integer are called Fermat numbers. Fermat conjec- 
tured that all such numbers are primes. For n = 0,1,2,3,4 we have the 
respective primes 


3,5,17 , 257 , 65537. 


However, Fermat’s conjecture is false. In fact, no other values of n are \ 
known which yield primes. It has been shown that Fermat numbers are 
composite for the following values of n: 


§,6,7,8,9,11,12,18 , 23,36, 38, 78. 


(Ref. 6, pp. 14, 15.) For example, 2* + 1 = 641 - 6700417. 
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Exercises 2.13 


1. If 2°+ 1 is a prime, show that r is a power of 2. 


2. If ris defined as in Theorem 2.11b, prove that (r!)? = —1 (mod 7p) if 
p =1 (mod 4) and that (r!)? = 1 (mod p) if p=-—1 (mod 4). The 
former congruence proves that 2? = —1 (mod p) is solvable if p = 1 
(mod 4). 

3. Prove that if p is a prime, r = (p — 1)/2 andr! = —1 (mod p), then 
p = —1 (mod 4). Is the converse of this statement true? 

4. Suppose in the proof of Theorem 2.118, the numbers 1,2,---,p-1 
were paired so that the product of each pair is congruent to —1. Show 
that if z? = —1 (mod p) has no solution, then 


(p — 1)! = (—1)" (mod p); 
that if 2? = —1 (mod 7p) has a solution, then 
(p — 1)! = (—1) (mod p); 


and hence in the respective cases, r is odd or even. 


5. Use the results of Exercise 4 to show that if 22 = —1 (mod p) is solva-~ 
ble then p = 1 (mod 4). 


6. Compute the first four even perfect numbers. 


7. The first four even perfect numbers end alternately in 6 and 8. Does 
this property hold in general? If not, is it at least true that every even 
perfect number ends in either 6 or 8? 


8. Since an equilateral triangle and a regular pentagon may be con- 
structed with ruler and compasses, we can divide the circumference of a 
circle into 3 equal parts and 5 equal parts by such construction. Show 
how this may be used to divide the circumference into 15 equal parts 
Without using the result on page 58. Generalize this result to show that if 
regular polygons of a and ὃ sides, respectively, may be constructed, where 
(a,b) = 1, then a regular polygon of ab sides may be constructed. 

9. Prove that o(n), the sum of the divisors οἵ n, is ἃ multiplicative func- 
τ Find o(p*) and use these results to get formulas for a(n) and 

o(d), 


dln 
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10. Prove that d(n), the number of divisors of n, is a multiplicative func- 
tion. Find d(p*) and use these results to get formulas for d(n) and 
2 An), where x is a given integer. 


11. Is the product of the divisors of n a multiplicative function? 
12. Show that if (m— 1)! = —1 (mod m) then m is a prime number. 
For what numbers m is (m — 1)! = 0 (mod m)? 


2.14. The Chinese Remainder Theorem 


There is an old puzzle attributed to Sun-Tsu in the first century 
A.D. which may be written as follows: Find the least two positive integers 
having the remainders 2,3,2 when divided by 3, 5, and 7, respectively. 
Let us see how this puzzle may be solved. All numbers having the re- 
mainder 2 when divided by 3 will be of the form 3n + 2 for some integer 
n. If 3n+2 =3 (mod 5), it will have the remainder 3 when divided by 
5. Then 3n = 1 (mod 5), which implies that n = 2 (mod 5) orn = 2 + 5z; 
that is, 3n-+2=8-+ 152. This gives all numbers which have the re- 
quired remainders (mod 3) and (mod 5). We want 8+ 15z to be con- 
gruent to 2 (mod 7); that is, 8+ 15% = 2 (mod 7), or x = 1 (mod 7). 
This gives ὦ = 1+ 7y and 3n +2 = 23+ 105y. Thus the two smallest 
positive numbers satisfying the conditions are 23 and 128. Notice that 
we began with a solution for 3 and got one for 5, thus one for 15; our last 
step gave a solution mod 3-5-7. That this process always works if the 
successive moduli are relatively prime is shown by 


Theorem 2.14. If (b,c) = 1 andr and 8 are any two numbers, there 
is a number x = r (mod δ) and = 8 (mod ο). 

' To prove this, notice that x = r (mod b) if and only if z is of the form 
r + δὲ for an integer ¢. Then we may determine ¢ so that r + bt = s (mod 
c), since ὃ and ¢ are relatively prime. 

Notice that if x and y are two integers which satisfy the conditions 
of Theorem 2.14, their difference is divisible by ὃ and by c and hence by 
bc. Furthermore, if x is a solution, x — tbc is a solution for any ὁ. This 
yields the 


Corollary 2.14a. There is just one nonnegative integer x less than 
be satisfying the congruences of the theorem, and all solutions are con- 
gruent to x (mod bce). 


2.14. The Chinese Remainder Theorem 61 


Corollary 2.14b. If m,ms, ---,m, are relatively prime in pairs and 
01,42, ***,@, are any 7 integers, the congruences below have a common 
solution: 


% = a; (mod m), = ας (mod mz»), ---, 2 = a, (mod m,). 


Corollary 2.14b often goes by the name of the Chinese Remainder Theo- 
rem, since such problems have been found in ancient Chinese literature. 
In fact much mathematical development in China in connection with the 
theory-of numbers antedated by centuries corresponding mathematics in 
Europe. (Ref. 4, Vol. 2, p. 57.) 


Exercises 2.14 


1. A woman with a basket of eggs was knocked down by a bicycle. In 
presenting her bill to the rider’s father, she said she did not know how 
many eggs she had but when she counted them two at a time there was 
one egg left, and similarly when she counted them three, four, five, and 
six at a time; but in sevens there were not any left over. What is the 
smallest number of eggs she could have had? 


2. If any or all of the following sets of congruences have a solution, find 
the solution : 


a. x = 3 (mod 6), « = 1 (mod 7), x = 2 (mod 18). 
b. # = 2 (mod 10), c = 7 (mod 15), x = —4 (mod 6). 
c. « = 3 (mod 4), « = 5 (mod 7), « = 7 (mod 9). 


3. Describe a method of finding solutions of a set of congruences in which 
the moduli are not relatively prime. 


Diophantine Equations 


3.1. Introduction 


A typical puzzle problem is the following: A man bought a number of | 


cows at $70 apiece and pigs at $50 apiece. He spent, in all, $530. How 
much of each did he buy? If c is the number of cows and p the number 
of pigs, we have the equation 


70¢ + 50p = 530, 
which is equivalent to the equation 
7c + 5p = 58. 
Now, if we could have fractional values of the unknowns, we could assign 
any value to p and solve for c. However, the values of p and c must be 
integers. Any equation whose solutions are restricted to integers is called 
a Diophantine Equation, after Diophantos (Ref. 11, p. 179). In this case, 
there is a further restriction that the solutions must not only be integers 


but cannot be negative. It can be solved by trial if we notice that we need ὦ 


to find a ὁ such that 53 — 7c is divisible by 5. If we try in order c = 1,2,3,4 


we find that the last is the first acceptable value, the values c = 5,6,7 are 


not acceptable and any larger value of c would make p negative. Thus 
the only solution which fits the problem is c = 4, Ὁ = 5. 


- There is a better way to solve the problem. Notice that to make — 


53 — 7c divisible by 5 is to solve the congruence 53 = 7c (mod 5). This 
is equivalent to solving 3 = 2c (mod 5); that is, 8 = 2c (mod 5), which 


has the solution c = 4 (mod 5). Hence all values of c must be of the form _ 


4+ ὅπ. Now n must be nonnegative if ὁ is, and it cannot be as great as 
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1 if pis positive. In fact, if we replace c by 4 + 5n in the equation we have 
28 + 35n + 5p = 53, or 35n + 5p = 25; that is, 7n+p=5. So our 
general solution is c = 4+ 5n, p = 5 — 7n for integer values of n. This 
shows that n = 0 if c and p are positive. It is also possible to begin by 
solving the congruence 53 = 5p (mod 7), but this is longer. 


Exercises 3.1 


1. Solve the following Diophantine Equations, indicating all solutions 
in integers: 

a. 5a + lly = 92. 

b. 102 + 14y = 39. 

6. 202 + 17y = 93. 


2. Aman received a check for a certain amount of money, but on cashing 
it the cashier mistook the number of dollars for the number of cents and 
conversely. Not noticing this, the man then spent 68 cents and discovered 
to his amazement that he had twice as much money as the check was 
originally drawn for. Determine the amount of money for which the 
check must have been written. 


3. Under what conditions on a, ὃ, and c will ax + by = c have one or 
more solutions in integers? Supposing that x and y is one pair of 
solutions, what formulas will include all solutions? 


3.2. A method of solution of a linear Diophantine Equation in 
two unknowns 


An equation az + by = c is called linear, since it contains no terms of 
higher degree than 1 in x and y. Solving this equation is equivalent to 
solving the congruence at = c (mod ὃ). This is solvable if and only if the 
g.c.d. of a and ὃ divides c, from Theorem 2.5a. If the g.c.d. of a and ὃ 
divides c, we can divide all three letters by the g.c.d. and have an equation 
in which a and ὃ are relatively prime. Then if 2 is one solution of the 
congruence az = c (mod δ) with (a,b) = 1, all solutions are of the form 
% + bn, since az = ay (mod ὃ) implies x = y (mod b). Putting this in 
the equation gives a(%+bn)+by=c; that is, by = c— ax — abn 
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However, a2 + byo = c implies that ὁ — ax = by, and by = by) — abn 
implies y = 7% — an. Thus we have 


Theorem 3.2. The Diophantine Equation az + by = c has one or 
more solutions if and only if the g.c.d. of a and ὃ divides c. If a and ὃ are 
relatively prime and 2,7) is one solution, all solutions are given by 


x = το + bn, y = yo — an 


for integer values of n. 

In this connection, geometrical considerations are of interest and 
value. The equation az + by = c represents a line in analytic geometry. 
The solutions in integers x and y will correspond to the points whose co- 
ordinates are integers and through which the line passes. We call any 
point whose coordinates are integers a lattice point. Thus, from the dis- 
cussion above, there are no lattice points on the line 3x + 6y = 1 and, 
from the previous section, only one lattice point in the first quadrant on 
the line 7x + 5y = 53. 

There are a number of ways of solving a linear congruence. We shall 
see in the next chapter that it may be done by means of continued frac- 
tions. Probably the best method in a numerical case for the solution of 
ax = Ὁ (mod m) aims at each stage to find a number congruent to ὃ 
(mod m) which has a factor in common with a and hence which can be 
divided into both sides of the congruence. Thus we can reduce the given 
congruence to one with smaller modulus and apply the method again and 
again to congruences with smaller and smaller moduli. We illustrate the 
method by solving 


19x = 73 (mod 125). 


1. We seek a multiple of 125 which, when added to 73, gives a num- 
ber divisible by 19. That is, we need a y such that 73 + 125y = 0 (mod 
19). Since 73 is congruent to —3, and 125 to —8 (taking the smallest 
residues in absolute value), the last congruence becomes —3 — 8y = 0 
(mod 19), or 8y = —3 (mod 19). 

_2. We may see by inspection or use the same method to find —3 = 16 
(mod 19); that is, 8y = 16 (mod 19), which yields the solution y = 2. 

This is the end of our “‘chain” of congruences, and, substituting 
y = 2 into 73 + 125y, we get 323, which is 19-17. This shows that 17 is 
a solution of the given congruence. 
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This method may be used to prove the existence of solutions (see 
Exercise 14, below). It should be noticed that a Diophantine Equation 
7x + lly = 32 can be reduced to a congruence (mod 11) or (mod 7) as 
shown above. 


Exercises 3.2 


1. Solve each congruence, 69z = 21 (mod 96), 1452 = 35 (mod 625). 


2. What is the least positive value of ὁ for which the line 15x + 2ly = c 
has lattice points? What is the corresponding general result for the equa- 
tion ax + by = οἵ 


3. Find all positive solutions of the Diophantine Equation 232 + 37y = 
212. 


4. How far apart are the lattice points on the line 7x + 5y = 538? 7x + 
5y = 1? If aand ὃ are relatively prime, how far apart will be the lattice 
points on the line az + by = οἵ 


5. Use the results of Exercise 4 to show that if a,b, and c are positive, 
(a,b) = 1 and ab <c, then the line ax + by = c has at least one lattice 
point in the first quadrant not on either axis. What does this imply about 
the positive solutions? 


6. If (a,b) = 1, show that the line az + by = c has infinitely many lat- 
tice points in the first quadrant if ab < 0, finitely many or none if ab > 0 
with ac > 0, and none if ab > 0 with ac < 0. 


7. If (a,b) = 1, and a,b, and ¢ are positive, show that there are (a + 1) 
(b+ 1) lattice points within and on the rectangle whose vertices are 
(0,0), (6,0), (0,2), (6,2). Hence show that within but not on the triangle 
bounded by the axes and the line az + by = ab there are exactly (a — 1) 
(6 — 1)/2 lattice points, thus that there are exactly (a — 1)(b — 1)/2 
positive values of c less than ab for which ax + by = c has positive solu- 
tions. 


8. Find the least integer ὃ for which the equation 52 + 7y = b has ex- 
actly six positive solutions. 


9. A customer buys an article for 48 cents. He has a $1 bill and 3 pennies, 
while the shopkeeper has 6 dimes and 7 nickels. In how many ways can 
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the change be arranged? (Ref. 14, p. 65; see also Exercises 11 and 12 in 
the same set.) 


10. A typical problem in elementary algebra is the following: Each οὗ 


two trains ran 180 miles, one at a speed of 6 miles per hour faster than the 


other. If the faster train required one hour less time find the speed οὖ 


each. Suppose 6 is replaced by a and 180 by ad, find the speed in terms of 


aandd. Show that for integral a and d, the speed is an integer if and only ~ 


if ἃ = ὃ + ¢ for some integer ¢. 
11. Show that the number of nonnegative solutions of the equation 


rt+2y+3z=n 


is the integer nearest to (n + 3)2/12. (Ref. 14, p.66.) Graphical methods _ 


may be used. 

12. How may Euler’s Theorem be used to show the existence of a solu- 
tion of the congruence az = ὃ (mod m) when (a,m) = 1? Would you 
consider this a practical method of numerical solution? 


13. Show that ax + by+cz=d has integer solutions if the g.c.d. of 
a,b,c divides d. 


14. How may the method used in the numerical example of this section — 
be used to show the existence of a solution of ax = ὃ (mod m) when — 


(a,m) = 1? 
15. Prove the first part of Theorem 3.2, using Theorem 1.8d. 


3.3. Linear Diophantine Equations in more than two unknowns 


One might guess, as a generalization of results for an equation in two 
unknowns, the truth of the following: 


Theorem 3.3. The Diophantine Equation 
G10 + Geta t+ +++ + Anta = ἃ 
is solvable if and only if the g.c.d. of the a,’s divides d. 


‘We prove this for n = 3 and leave to the reader the proof by induction — 


for any n. Suppose g is the g.c.d. of a; and a2. Then, no matter what in- 
teger y is, we can, from Theorem 3.2, solve 


QyX1 αὐ = Gy. 


-— 
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Thus the solution for n = 3 is reduced to finding integers y and z; such 
that gy + asx; = d. This has a solution if and only if the g.c.d. of g and a; 
divides d. However, the g.c.d. of g and a; is the same as the g.c.d. of 
αι, @2, @3, since any divisor of g and a; divides a; and a2, while any common 
divisor of a;,a2 and a; divides both ganda;. This establishes our theorem. 

The method of the proof may be used to find a solution in a numerical 
case as may be illustrated as follows: To solve 


6x + 10y + 152 = 37, 


notice that 6a + 10y = 2¢is solvable for any ¢ and a solution of 2ὲ + 152 = 
37 ist = 11,2 = 1. Hence the general solution of the latter ist = 11 — 15u, 
z=1+2u. Then 6x + 10y = 2¢ implies 32 + ὅν = 11 — l5u or 32+ 
δίῳ + 3u) = 11, and the general solution is 


z=2+ 5v,y+ 3u=1-—30,2= 1 - 2u. 


Though the above example showed a solution when each pair of co- 
efficients had a common factor greater than 1, the final solution was 
somewhat special, since the coefficient of y divided that of wu in 82 + ὅν = 
11 — 5u. In general it would be considerably longer. 

There is another method of solution whose motivation is not quite 
so clear but which gives results very expeditiously. The fundamental idea 
is this: Suppose the g.c.d. of a,b,c is 1. Then if we can find a’,b’,c’ and 
a’’b’’ c’’ so that the determinant 
ed ae - 
δ΄ YS 
a” δ cl’ 
is 1, we shall set up the equations 

az+ by+cz=2', 


avet+by+cz=y', 
a2 + by + cz = κ΄. 


Since the determinant of the coefficients is 1, these may be solved for in- 
tegers z,y, and z in terms of 2’,y’, and 2’; setting x’ = d and considering 
y’ and z’ arbitrary parameters, we have a general solution. The chief 
difficulty, then, is to find the second and third rows of the determinant. 
This may be done as follows: Let g = (a,b) and as—br=g. Then 
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(a,b,c) = 1 implies (g,c) = 1, and there are integers u and v such that 


gu—cv=1. Set up the determinant 


a b 
Tr 8 ° 
av/g bv/g 


Expanding by the last column, we have c(rbv/g — sav/g) + u(as — rb) = 
cv(rb — sa)/g + u(as — rb) = —cv + gu = 1. 
Let us see how this goes for a numerical example. To solve 
8x + 10y + 32 = 34, 


notice that g = (8,10) = 2 ands = —1,r = —1 isa solution of 88 — 10r = 2, 
while u = 2, v = 1 isa solution of 2u — 3v = 1. Thus we have 


82 + 10y+3z= 2’, 
=r ae Ξε ψ', 
4. - 5y+2z=2’. 


Solving for ,y, and z gives us the equations 


- —2e'— By’ + 32’, 
y= 2: τάν — 32’, 
z2= —g’ + 22’, 


and, putting x’ = 34, we have the general solution in terms of the param- 
eters ψ' and 2’. 

Another type of puzzle problem leads to two linear equations in three 
unknowns: A farmer with $100 goes to market to buy a hundred head of 
stock. Prices were as follows: calves, $10 each; pigs, $3 each; chickens, 
$.50 each. He gets exactly one hundred head for his $100. How many of 
each does he buy? If we let c be the number of calves, p the number of 
pigs and f the number of chickens, we have the two equations: 


10c + 3p + $f = 100, 
c+ p+ f= 100. 


If we subtract the second from twice the first, we have the equation 
106 + 5p = 100, 


and 100 = 19¢ (mod 5) implies ὁ = 5u and hence p = 20 — 19u. Sub- 
stitution in the second given equation gives f= 80+ 14u. Since the 
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value of c must be nonnegative, ὦ must be nonnegative, and p nonnegative 
implies τὸ S 1. Hence the only permissible answers are 


0, 20, 80 for τ = 0, and 5, 1, 94 for u = 1. 


If there is to be at least one of each kind of stock, the second is the only 
answer, 


Exercises 3.3 


1. Three chickens and one duck sold for as much as two geese; one chicken, 
two ducks, and three geese were sold together for 25 shillings. What was 
the price of each bird in an exact number of shillings? 


2. Find necessary and sufficient conditions that the equations 
ax + by +cz =d, 
a2x+by+ez=d' 

be solvable in integers. 


3. Along the lines of the second method of solution of a single linear equa- 
tion in three unknowns, develop one for the solution of two linear equa- 
tions in three unknowns. 

4. Find all the integer solutions of 2? — 7? = 15. 

5. Show that if (71,y1,21) and (22,y2,22) are solutions of av + by +cz=d 
then (a1 —2%2, ¥1 —Y2, 21 — 22) is a solution of ax + by+cz=0. Find all 
the solutions of the latter and use this to show that all solutions of the 
former equation can be written in the form 


r= My τα Ἢ sy, 
yY=Yitrut sw, 
5 τ ζι + ryt sp, 


where u and »v are arbitrary integers and (r1,72,73) and (81,82,83) are solu- 
tions of ax + by + cz = 0. 


3.4, Quadratic Diophantine Equations 


Quadratic Diophantine Equations arise, for example, in such prob- 
lems as the following: Find an integer b such that the following equation 


ee ttt 
--" . 


᾿ 
" 
" 
[ 
"ἢ 
ial’ | 
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leads to a “‘factorable’’ quadratic in the sense of intermediate algebra, 
that is, to a quadratic equation whose roots are rational numbers: 
1 “ὦ 
ἜΣ ΣΧ ΣΝ 
On multiplication and collection of terms we have the equation 


τ᾽ -- 2(6+ ὃ) +b+8=0. 


If this is to have rational solutions, its discriminant must be a perfect — 


square; that is, for some integer s, 
83 τ 6° + 80 1 4 - (Ὁ - 4) -- 12. 
Thus we need to solve for ὃ the Diophantine Equation 


12 = (Ὁ + 4)? — 83, 
Hence one factor of 12 must be b+ 4 Ὁ and the otherb+4—s. In 
symbols, 
r=b+4+8, 
ἐτ- ὃ - 4 -- 8, 


where 7i= 12. Thus r+ ¢=2(b+4) and 7 -- ὁ -- 25. Because of the 


factor 2 in the right members, these equations may be solved for integers 
b and s if and only if r and ¢ are both odd or both even (we say briefly, 
“yr and ¢ are of the same parity”). Since interchanging r and ¢ does not 


change ὃ, we may choose r to be less than ¢ in absolute value and have the 


following possible pairs of values for r and t: 
r=+1t¢= +12;r=+2,t=46;r= +3, ¢= 44. 


However, r and ¢ must have the same parity, and hence the only values 
to be considered are 


r=2,t=6andr=-—2,t=—6, 
which yield 
b= 0 and b= --8, 


Only the latter value yields a quadratic equation. The same method may 
be used to prove 
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Theorem 3.4. The Diophantine Equation «ἢ — u’y* = d has the solu- 
tion 2a = r+ t, 2uy = τ — t, where rt = ἃ and r — ὁ = 0 (mod 2u). 

The proof is left to the reader, who can also see that the equation has 
no solutions if d = 2 (mod 4). 

Theorem 3.4 shows that such an equation has only a finite number 
of solutions. It may therefore seem remarkable that the equation 

α -- οὐδ ΞΞ ἃ 
has either no solution or an infinite number if ¢ is not a square. Ifd=1 
this is the so-called Pell Equation, which is discussed more fully in the next 
chapter. At this point we shall be content to show that if 2o,Yo is a solu- 
tion of 
2? —cy=1, 
with yp not zero and ¢ not a perfect square, there is an infinite number of 
solutions. To this end, let n be any positive integer and see that the ex- 
pansion of (2) — Vcyo)" determines integers ὦ and y by means of the fol- 
lowing equation: 
(to — Vey)" = 2 — Vey. 

That is, x is the sum of all terms in the expansion of the left side of the 
equation in which the exponent of V is even, and Vcy all those for which 
the exponent of V¢ is odd. If in each term of « we replace —Vc by Vc, 
we produce no change; if we make the same replacement in the other 
terms, the sum becomes +Vc-y. Hence 


(xo + V eyo)" = «+ Vey. 


Taking the product of the left sides of these two equations and setting it 
equal to the product of the right sides, we have 
(a2 — cy?)" = 2? — cy’. 

The quantity in parentheses on the left side is 1, and hence the right side 
is 1 also and z,y is a solution of the Pell Equation. Since (a + Vcyo)” 
increases with n, no pair of values of z and y for any ἢ can be the same as 
any pair for another ἢ. 

In contrast to the above result for the Pell Equation there is a re- 
markable theorem of A. Thue, which is as follows: If 


f(z) = Gna" + Gp-e™ 1 + +++ tazt a, n 2 3, 
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has integer coefficients and is irreducible, that is, has no polynomial fac- 
tors with integer coefficients, then the equation 


OnE” + Ane" ty + +++ + ayry™ + ay” = ὃ 


has only a finite number of solutions in integers x and y, zero being finite, 
(Ref. 10, p. 260 ff.) 


Exercises 3.4 


1. For what integers b does 
1 b 
=i ae 
yield a quadratic equation with rational roots? 
2. Prove Theorem 3.4. 
3. Show that if 2? — cy? = 1 has a solution with y ¥ 0 and z?— cy τ ἃ 


has a solution, then the latter equation has an infinite number of solutions 
if c is not a square. 


3.5. Pythagorean numbers 


There is a particular kind of quadratic Diophantine Equation which 
is connected with the right triangle. It is well known that there is a right 


triangle with sides 3,4, and 5. This collection of numbers is called a 


Pythagorean set. To find all right triangles whose sides are integers we 
must solve the equation 


att γ᾽ = at, 


First, notice that if z,y,z is a solution, then bz,by,bz is also a solu- 
tion; and, conversely, if any two of z,y, and z have a prime factor in com- 
mon, it must divide the third, and, after division, we get another solution. 


Hence we may assume that no two have a factor in common. Such solu- 
tions we call primitive solutions; all solutions will then be multiples οἵ 
these. Furthermore, if x and y were both odd, 22 would be congruent to 


2 (mod 4), which is impossible. They cannot both be even, since we have 
ruled out the case when they have a factor in common. Then suppose 
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we let y be even and equal to 2u. Thus we have reduced our problem to 
finding solutions of 


x? + 4u? = 2°, 


with x odd and no two of x,u,z having a factor in common. 
The equation may then be written in the form 


2 — «ἢ = 4y?; 
that is, 
2—-2=2r,2+ 2 = 23,7rs = wu’, 


since x and z are of like parity, which implies z = r+s,2=s—r. Now, 
if r and s had a prime factor in common, it would divide z and x. How- 
ever, 2 and x are relatively prime, which implies that r and 8 are also. 
Thus 7s = uv? implies that r and s are squares which may be written 
r=m’,s=n', with mn =u. This gives us 


z2=m+n?,x = n? — m’, y = 2mn 


as the complete solution of z?+ y? = 2. That is, for each pair of values 
of m and n we have a solution, and if x,y,z is a solution in which no two 
have a factor in common, there are integers m and n satisfying the above 
equations. 

It is not hard to show that the Diophantine Equation x’ + y* = 2° 
has no solutions in positive integers x,y,z (Ref. 14, pp. 408 ff.). In fact, 
it is conjectured that 


a” + yy" = 2" 


has no solution in positive integers if n is greater than 2. This conjecture 
is called Fermat’s Last Theorem, and, in spite of the efforts of many 
mathematicians, has never been proved for all integers n. 


3.6. Other Diophantine Equations 


While there are systematic methods for solving many Diophantine 
Equations, many solutions depend on the ingenuity and observation of | 
the solver. To show how such solutions may be effected, we consider the 
following problem: 


74 _Diophantine Equations , 

For the fraction 16/64 one might erroneously cancel the 6’s and _ 
arrive at 1/4, a correct result. If ab stands for the two-digit number 
10a + ὃ, and be for 10b + ο, for what fractions ab/bc is the quotient equal 
to a/c where the numerator is a two-digit number with digits a and ὃ, the 
denominator is a two-digit number with the digits b and c, and b> 0? 

To solve this problem we need to find positive integers less than 10 
such that 


10a +b _ a. 
10b +c ο’ 
that is, 10ac + be = 10ab + ca, or 
(1) 10a(c — ὃ) = c(a — ὃ). 


Now, 10 is a divisor of the right side, and it is not a divisor of either c nor 
a — ὃ, since a,b,c are positive integers less than 10. Hence either c is even 
and a — b is divisible by 5, or a — ὃ is even and c is divisible by 5. We 
consider these two cases separately. 

1. a— ὃ =0 (mod 5) and c = 2c’ for an integer c’. Thena—b=0 
or +5. In the first case (1) shows that c = ὃ, and our fraction is aa/aa. 1 
the second (1) reduces to 


(ὃ + 5)(2c’ — b) = 4c’ 


The left side is even, since the first or second factor is even according as 
ὃ is odd or even. Since c’ must then be not greater than 4 and since with 
the plus sign the left side would be greater than 4, the ambiguous sign 
must be negative. Then a = ὃ — 5> 0, and we have 


(ὃ — 5)(2c’ — ὃ) = —c’, b> 5, 


which becomes δ᾽ — b(5 + 2c’) + 9c’ =0. Thus, if c =2,b= 6, a= 1; 
ifc' =4,b= 9, a = 4, and we have the fractions 


16/64 , 49/98. 
2. a—b=2u,c=5. Then equation Ἢ gives 
2a(5 — b) Ξ α -- 


Since a divides the left side, it must divide the right side, and hence it is a 
divisor of b. Write b = ka, and substitute in the last equation to get 


9 = k(2a — 1), 
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which shows that k divides 9 and hence is 1, 3 or 9. In the respective 
cases we get the fractions 


55/55, 26/65, 19/95. 
Thus the only solutions of our problem are 


aa/aa, 16/64, 26/65, 19/95, 49/98. 


Exercises 3.6 


1. A mistake often made by beginners is to write 

a/b + c/d = (a+c)/(b+d). 
Assume ὃ and d are positive integers and prime respectively to a and c. 
Find all solutions in integers of this equation. 
2. The number 3025 has the property that if the numbers 30 and 25, 
formed from the digits in order of the two halves of the number, are added 
(giving 55) and the result squared (55) the result is the original number. 
How many other numbers are there composed of four different digits 
having the same property? 
3. Show that if a/b « c/d with ὃ and ἃ zit integers then 


ate 
ἼΩ ΣΤ be i 


Will the same result hold if we remove the restriction that the letters are 
integers? 
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4.1. The Fibonacci Sequence 


There is a very interesting sequence of integers which bears the name of 
Fibonacci, an Italian mathematician of the twelfth century. The first — 


few numbers of this sequence are 
1, 1, 2, 3, 5, 8, 18, 21, 34, .... 


Each number is the sum of the two preceding ones. These numbers occur 
in nature in the following manner. If a twig or branch is selected, and, — 


starting with one bud, one’s hand moves to the nearest bud and so con- 


tinues in spiral fashion around the twig until it reaches a bud just above — 
the starting one, the number of intervening buds will be one of the 


Fibonacci numbers, and the number of revolutions around the stalk will 
be another such number, which ones depending on the particular plant. 
A pine cone gives a similar result. 

We are concerned here, however, with the mathematical properties 
of the sequence. Notice that 1-3 — 2? = —1,2-5 — 3? = 1.3.8 -- 5% = -¥ 


5-13 — 8 = 1, and soon. This property is connected with a geometrical — 


deception which may be familiar to the student. A square 8 units by 8 
may be cut up as shown in the figure below and the pieces fitted together 
as shown to give a rectangle 5 by 13. The area is thus increased by 1 
square unit. Notice that the numbers appearing are those of four suc- 
cessive numbers of the Fibonacci Sequence. 

The property noted above is equivalent to the sequence of equations 


1/2 — 2/3 = —1/6, 2/3 — 3/5 = 1/15, 3/5 — 5/8 = —1/40, ete. 
76 
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This means that if the quotients of successive terms are formed, their 
differences are alternately negative and positive and each is less in ab- 
solute value than the preceding. In other words, if we call uv, the nth 
term of the Fibonacci Sequence where 1 and 1 are the first two terms, the 


ratios U,/Un+1 for n > Ὁ form a decreasing sequence for odd values of n 
and an increasing sequence for even values of n, each of the former ratios 
being greater than each of the latter. (This will be rigorously shown later.) 
The ratio t¢n/Un41 approaches a limiting value which we shall call k, as- 
sociated with what the Greeks called the Golden Section. The rectangle 
the ratio of whose sides is 1/k was supposed to be the most beautiful. 

There is an interesting trick by which one can give a closed expression 
for the nth term of the Fibonacci Sequence and many other sequences 
similarly determined. First of all, consider the Fibonacci Sequence and 
notice that the nth term is given in terms of the two preceding ones by 
means of the formula 


(1) Un = Un—t + Un—s. 
The trick is to set 
(2) Un = ca” + dp” 


and show that c,d,a, and 6 may be determined so that wu: = 1, τὲς = 1 and 
(1) holds. It will then follow that (2) gives a formula for the terms of the 
Fibonacci Sequence. 

First, we find how a and 8 can be chosen so that (1) will be satisfied. — 
If (1) is to hold, the following equation must be true: 


ca" + dB" = ca + dp + ραν 3 + der, 
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The coefficients of c and d in this equation are 

a" (a? — a — 1) and 6 *(6* — 6 — 1), 
respectively. Hence the last equation will be satisfied if a and β are the 
roots of the equation 
(3) 2’—z-1=0. 


Yor this choice (1) will hold whatever the values of ὁ and d. 
Second, we choose ὁ and ὦ so that wu: = 1 and wz = 1; that is, 


1=ca + dB, 
1 = ca*® + dp’. 


Since a γέ β these equations can be solved for c and d with the following 
result: | 
Ml id SE μι te 

° = aB— a)’ * ~ BE =a) 


Then, if we denote by a the root (1 — +/5)/2 of (8) and β the other root, 


we see that 8 — a is equal to ν΄ ὅ, and a little manipulation shows that — 
6- —1//5, ἃ = 1/V5. 


Hence 


(4) Un = (β" — a®)/V5. 


Exercises 4.1] 


1. Using the numbers 5,8,13,21, dissect the square 13 inches on a side in 
a fashion similar to that in the figure above. | 


2. Using the formula (1) for u, of the Fibonacci Sequence given above, 
show that 

UnUnge — U4, = (—1)"™. 
Hiv: Show that the left side is equal to —(Un-1n41 — u2). Achieve the 
same result using formula (4). 


3. Using previous results, find lim u,.1/u, for the Fibonacci Sequence. — 
' {πα} 


4. Consider the sequence 
1,1, 3,7, 17,+°:, 
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in which uv; = 1, v2 = 1, and up, = 2tn_1+ Uns. Using the methods used 
for the Fibonacci Sequence, find a formula for the nth term. 
5. If the equation zx? — ax — ὃ = 0 has two distinct roots, a and 8, where 
8 > a, and if the sequence of integers u, is defined by its first two terms 
and the equation 
Un = GUn—1 + buns, 
show that a formula for wu, is 
Un = (Bur — 12) αὐ ἢ ᾳ + (U2 -- at)B""/g, 
where g = Va? + 4b. 
6. If in the sequence defined in the previous exercise [αἱ >|8| show that 
Un/Un—1 approaches a as n becomes infinite. Is there a limit if a = —f? 
If so, find it; if not, show why not. 
7. If in Exercise 5, a = 8 show that the nth term of the sequence is given 
by the formula 
Un = (αὐ + nda, 
where a is the root of «ἢ -- αὐ -- ὃ Ξε: Ο, c= 4(aui — us)/a® and d= 
2(2u2 — au;)/a*. Note that a=a/2. In this case will u,/u,-1 ap- 
proach a limit as n becomes infinite? If so, find it; if not, show why not. 
8. Suppose we have two sequences whose nth terms are p, and gp, re- 
spectively, where Dna = Pn—1 + Pn—2; Gn = Qn-1 + Gn—2 and pi = 1, po = 2, 
while φι = 4,g.=7. Prove 


Pndn—i ~ ΠῚ »--ἰ = (—1)*. 


4.2. Properties of Fibonacci-like Sequences 


As we found from the exercises above, there are many sequences 
having many of the properties of the Fibonacci Sequence. In fact, we 
may combine the types mentioned in Exercises 5 and 6 above and con- 
sider the pair of sequences p;,9., Where 


(1) Pi = αχρι-α + Pr—2, Ue = ἀκῃκ.--α + Qe—2. 
Now 
Pe Pr~1 QkPr-i1 t+ Pr~2 Pr—-1| |De-2 De—1 
PrQe—1 — QePe—1 = on > ᾿ 
Ge Qk-1 OnGe-1 + 4..." 6... Qk-2 = Qk—1 
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The second equality follows from (1), and the third is obtained by sub- 
tracting a, times the second column from the first, an operation which we 
know leaves unaltered the value of the determinant. Thus, noticing that 
the last determinant is equal to 


= (Pr—19e—2 » Qk~1Pk—2), 


we see that if in the expression p.q.-1 — Qxpx—-1 We decrease the subscripts 
by 1 we merely change its sign. Hence, if we decrease the subscripts by 
k — 2, we change the sign k — 2 times; k diminished by k — 2 is 2, and 
we have 


(2) (ρκαι-ι — QuPr—1) = (—1)* (ρ:ᾳι — Gap) = (—1)*(pagi — Gupr). 
Thus we have proved 


Theorem 4.2a. If p, and q are the kth terms of two sequences de- 
fined by the equations (1) and if peoqi — gop: = 1, then 


PrQe—1 — QePr—1 = (—1)*. 
An immediate consequence of the theorem is 
(3) Pr/ Ge — Pr—1/Ge—1 = (—1)*/(Gude-r) 3 


that is, two consecutive quotients differ in absolute value by 1/q:q.—1, and 
the differences alternate in sign. Furthermore, 


Dr Pr—2] |QePr—1 + De—2 Pim? 
Prak—2 τσ QkPr—2 = "Ἢ 
4, Qk—2} ἰακᾳικ.-α Ὁ 4ι-- 4... 
Pk—1 Pk—2 
= Oy = a,(—1)*", 
Tk—-1 Vk—2 


where the third equality results from subtracting the second column of 
the determinant from the first. Thus we have 


Theorem 4.2b. If p, and q, satisfy the conditions of Theorem 4.2a 
and a, is positive, then the fractions p,/q, form a decreasing sequence for 
k even and an increasing sequence for k odd. 


Let us see now what we have just proved in terms of the Fibonacci — 
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Sequence with which we started. In equations (1) let a,=1 for every k, 
and choose 

r= 1, Po= 1, a= 1. go = 0. 
Then we will have the following table of values for p, and αι: 


k 4 Ὁ 5.4 6:5. chito. fie 
Ue O114235685.Bn an δ ἂν 
1123 5 8 18 21 34 ὅδ 8) 144-:--. 


Thus the p, and gq, range over the Fibonacci Sequence with px = qe+1. 
Furthermore, if « is the kth term of the Fibonacci Sequence, we have 
Ux = Ue = Pk—1- 
Hence equation (3) becomes 
Uns 1/'Une — Une/‘Un—1 = (— 1)*/ (Uxte—1). 
Theorem 4.2a states (notice that for the choice above p2g1 — 92p1 = 1) 
Uns Uk—-1 — UW, = (— ἡ» 

and Theorem 4.2b states that uz..1/u form a decreasing sequence for k 
even and an increasing sequence for k odd. These are the results which 
we found empirically and have now established. 

In an exercise it was shown that the ratios u.41/u, approach a limit. 
We are now in a position to show that under the conditions of Theorem 
4.20 the ratios p,/q@ approach a limit. By “approaching a limit” we 
mean that there is some number L so that, by choosing k large enough, 
we can make 

|L — Ῥω αι 

as small as we please. We have seen that the quotients p:,/q form a de- 
creasing sequence for k even, and an increasing sequence for k odd. Fur- 
thermore, each even-numbered ratio is greater than every odd-numbered 
ratio, since if r is greater than k 


Dox/ ox > Prr/ Gar > Por—1/Q2r—1- 


However, these properties alone do not guarantee the existence of a limit, 
as is shown by the following sequence: 


—1, 4, 0, 3+1/2,1—1/2,3+1/4,1—1/4,3+1/8,1—1/8,---. 
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Here the even-numbered terms (second, fourth, and so on) form a decreas 
ing sequence, the odd-numbered terms form an increasing sequence 
each of the former is greater than all of the latter. However, the bare of 
the sequence do not approach a limit since the evee-susahored terms ¢ ᾿ 
proach 3 and the odd-numbered terms approach 1. The difference ne | 
tween. successive terms approaches 3 — 1, which is 2. For the existence 
of a limit we must prove that the difference between successive 
approaches zero. This is easily done for the ratios p,/qs; for, from (3), the 
differences between successive quotients of the sequences (1) are i ab- 
solute value equal to the following: 


(4) 1/(Q2q1), 1/(qsq2), 1/ (gags), «++ , 


where the products in the denominators form an increasing sequence of 

positive integers. Hence the fractions in (4) certainly decrease more 

ning than 1/2, 1/3, 1/4, --+ , which approach zero. 
uppose, then, we define a regular pair of sequences to be 

uences 

px and q, defined by the relationship (1) with a, positive iene and 

P21 — φῆρι = 1. Then we may summarize the results so far obtained in 


Theorem 4,2c. If p, and q are the kth terms of regular pair ool 

-" 8 οὗ q 

quences, they satisfy the conditions of Theorem 4.2a, and the quotient 
px/q. approaches a limiting value as k increases without bound. . 


4.3. Simple continued fractions 


Regular pairs of sequences are connected with so-called continued! 


fractions, which have many uses and interesting i 
ions, | roperties. i 
fraction is a number of the form ΤῊ ΔΙ | 


want no αι except perhaps the first is zero. If the b,’s are all 1 and the — 
a,’s Positive integers, except that the first may be zero, we call it a simple 
continued fraction, and it is only this kind of continued fraction that is to 
be considered here. From now on we use the term “continued fraction” 
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instead of “simple continued fraction.” Such fractions may be written 
much more compactly in the form 


j= (a1,@2, ** *), 


since, knowing the a,’s, we can easily write the longer form. Such a frac- 
tion may or may not have a last term. In the latter case we call it an in- 
finite continued fraction. 

If fis the continued fraction (a1,d2, - ++), we call a its first convergent, 
(a3,@) its second convergent, --- , (a1,a2, --*,@x) its kth convergent. 
The kth convergent may be written in the form pi/q:, where px and g, are 
relatively prime positive integers. 

The simplest continued fraction in a certain sense would be that in 
which the a’s are all equal to 1. Its successive convergents are 


(1) 1, 2, 3/2, 5/3, 8/5, -+*; 


which are the reciprocals of the quotients of successive numbers of the 
Fibonacci Sequence. We know from the previous section (why?) that 
these quotients converge to a limit and that hence the value of the con- 
tinued fraction approaches a limit. So we may let 


1 
I 
ΩΣ εἰ Ὁ 


z=1+ 


Since this continued fraction has the same value if we start with the 
second term as if we started with the first, we have 


z=1+1/z2, 


or z2-2x—1-=0. Use of the formula for the solution of a quadratic 
equation yields 


po ἘΣ 9. 161034 ---, 


where the positive sign is chosen, since z is positive, and 


Vo τος 5-1 = 618084 ---. 


1/z = 
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1. Find the limit approached by the quotients of successive terms of the 
sequence of Exercise 4 in Section 4.1. What would be the limit for 
Un = AUn—1 + τ... οἵ 
2. Find the limiting value of the periodic continued fraction 
(1, 2, 1, 2, 1, 2,°°*). 
For this continued fraction express p, and g, in terms of p,-1, p,—2 and 
Gx~1, We—2, respectively. 


3. Prove that piges— qxpr_s = (ακακ.« + 1)(pesqu—s — Ge—«pe—s), Where 
px/d is the kth convergent of a continued fraction. 

4. What is wrong with the following “proof” that 1+3+9+---=—1/2? 
Letz=1+3+3?+--- =14+31+3+4+3?+---). Hencez = 1+ 31, 
which gives = —1/2. What is the essential difference that makes this 
proof invalid while the similar proof above is valid? Does this method 
of proof suffice to evaluate 1 + 1/2+1/4+1/8---? 


4.4. Computation of the convergents of a continued fraction 


Much of the labor of computation of convergents of a continued 
fraction may be avoided by establishing formulas for p, and q. Com- 
putation shows that for the simple continued fraction (a;,a2, «++ ,a,) the 
first three convergents are 


αι, (ανας + 1)/a2, [as(aia2 + 1) + a1 ]/(asa2 + 1). 
From this it may be seen that 
Ps = ἀφ. -ἴ Pr, 48. = Asg2 + 6ι, 


which is exactly the relationship (1) in Section 4.2 for k = 3. That this 
holds for all & is the statement of 


Theorem 4.4a. If p,/q, is the kth convergent of the continued fraction 
(α:,α:, ---), where p, and q are positive, relatively prime integers, then 


Pk = UxPr—1 + Pr—2, Ge = AxQu—1 + Qe—2. 
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We have already shown this for k = 3. Let us assume that it holds 
for k = r and show that this implies that it holds for k = r + 1; this will 
then prove the theorem by induction. Now 


p,/ qr = (41,02, Soni sr), 


and thus 741/941 Will be obtained from this by replacing a, by a, + 1/a,+1. 
If we make this replacement in p,/q, = (α,»,..« + Py—2)/(Grqr—1 + 4,...) We 
get 

Prt — Grti(GrPra + Pra) + Pra ἀτμρ, + Pes, 

Geet ἀρμία,,,-« + Gre) + Gra ἀνῇ, 1 Ora 
Since the numerator and denominator of the fraction on the right are 
positive integers, to establish our result we need only to show that a,+.1p, + 
Pr—-1 aNd 419, + g,-1 are relatively prime. However, 


GrpiPr + Pra P 
αν μια, + Qr-1 q 


by Theorem 4.2a, and any common factor of the elements of the first 
column of the determinant would have to divide 1; this shows that p,41 
and q,41 are relatively prime and completes the proof of the theorem. 

Since these relationships are the same as (1) in Section 4.2 and 
Poq1 — Qop1 = 1, the sequences p;,g, are regular sequences and we can 
conclude from Theorems 4.2a and 4.26 the truth of 


Theorem 4.4b. If p:/q: is the kth convergent of a continued fraction, 


Ῥκᾶκ--α το UPr-1 = (—1)*, 
and the continued fraction converges. The conditions of Theorem 4.2b 
also hold. 

There is a systematic way to compute the convergents of a given con- 
tinued fraction. To do this, notice that pi = αι, p2 = aia2+1 would 
permit us with consistency to define p = 1 and p_;=0. In a similar 
fashion, we would have q = Ὁ and 4.1 Ξ 1. Then suppose, for example, 
αι = 1, a2 = 2, ας = 3, --- , and we form the following table: 

αι L226. 8 6 


ρει 0 11 8 10 43 225 1393 
qei0 127 8. 87 97, 


where the number 3 in the second line is 2-1 + 1, 10 is 3-3+1, and so 
forth, the third line being similarly computed. 


= ιν — Grapr = +1, 
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1. Find the first five convergents of each of the following continued 
fractions, the second and third being periodic (see Section 4.7): 

a. (1,3,5,7, -°-). 

b. (1,2,2,2, +++). 

6. (1,1,2,1,2,1,2, ---). 
2. Show that the continued fraction of Example b above converges to 
4/2 and that of Example c to ν ὃ. 


3. Find the continued-fraction expansion of 355/113, and compute the 
first three convergents. 


4. Find the continued-fraction expansion of ν δ, compute the first: three 


convergents, and compare your values with +/5 = 2.236068. 
5. If px/qz = (@1,@2, "" "γακ), Show that a; is the greatest integer less than 
px/pe-1 and hence, from the result for all k, that 

Dr/ Pr-1 = (α,,α;.-"; ΠΩΣ 02,01). 
6. If q = pe-1, What can you conclude about the continued-fraction ex- 
pansion of ρκ ᾳεῖ 
7. Using the notations of Exercise 5, prove that px_1 = gq implies 


g@ + (—1)* = 0 (mod p). 


Conversely, prove that the congruence and p, > q, imply px-1 = de. 
Hence state a necessary and sufficient condition that a finite continued — 


fraction read the same forward as backward. 


4.5. Continved-fraction expansions 


We have so far been chiefly concerned in finding the numbers to | 


which continued fractions converge. Actually, from a computational 
point of view, it is of more importance to seek continued-fraction ex- 


pansions of given numbers. This is useful, since, as we have seen, the — 


various convergents give approximations to the number. For instance, 
one of the convergents of 7 is 22/7, the familiar approximation, and no 
fraction with denominator smaller than 7 is a better approximation. 
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It is apparent from the form of the continued-fraction expansion that 
if x is the continued fraction (a:,a2, --+) then a; is the greatest integer in 
az; that is, the largest integer not greater than z. We shall now illustrate 
the method of expansion, by three examples. 

Exampts 1. Find the continued-fraction expansion of 573/227. 
Now, 573/227 = 2+ 1/a2, where 22 = 227/119 = 1+ 1/x3, where x; = 
119/108 = 1+ 1/as, where 2, = 108/11 = 9+ 1/25, where 25 = 11/9 = 
1+ 1/xs, where x = 9/2=4+1/2. Using the sequence of a;’s ob- 
tained in this manner, we have 


each Gul ae 
012 3 5 48 53 260 573 
10112 19 21 103 227. 


Notice that we might also have chosen a; = 1, ας = 1 in place of a, = 2, 
in which case the next to the last convergent would have been 313/124. 

Exampte 2. Expand 1/17. Now νΊ7 =4+1/2, where 2 = 
1/(f1l7 — 4) = V17+4=8+1/22 Since z.= σι, the expansion of 
V/17 is (4,8,8, ---). To six places the value of ν 7 is 4.123106. We list 
below the first four convergents, their decimal values to six places, and 
their discrepancies from the true value. 


Convergent 4 33/8 268/65  2177/528 
Decimal value 4 4.125000 4.123077 4.123106 
Discrepancy  —.123106 +0.001894 —0.000029 -+-0.000000 


Actually, since we know that the true value of the number lies between 
any two successive convergents, we can tell from the expansion that 
4.1231 is accurate to four places and 4.123106 to six. Notice that the 
continued fraction is periodic. We shall prove later that this is true of the 
continued-fraction expansion of any real quadratic surd (real root of a 
quadratic equation with integer coefficients). 

Exampte 3. Find the expansion +/2. This could be done some- 
what along the lines of the above example, but there is a simpler way — 
a method which can be applied to get a positive solution of any poly- 
nomial equation. We seek, then, the real root of z?— 2=0. Since the 
root is between 1 and 2, we have x = 1+ 1/2, where 2; is greater than 1. 
One may find the equation in κι by substitution in the given equation, 
but it is simpler to diminish the roots by 1 and reverse the order of the 


88 Continued Fractions 


coefficients. Since the process usually employed in the theory of equa- 
tions may be unfamiliar to the reader, we illustrate the mechanical process 
referring him to any college algebra text for the reasoning involved. 


ἘΦ τἶ ἢ «ὃ. τὰ Thus, if z = 1+ y, y satisfies the 


Lat 1 ” equation ψὲ + 3y? + 3y -- 1 =0. 
ba 7 -1 Then y = 1/2; implies 
poe * 1 + 821 + 322 — 2? = 0, or 
. ae 
q zt — 322 — 8m — 1 = 0. 
δ᾽ Ὁ 


Notice that the last equation is obtained by reversing the order of co- 
efficients and changing the signs. Further computation yields 


a1 = 3 + 1/22, where 10χὲ — 6223 — 62, — 1 = 0, 
- 22 = 1+ 1/23, where 8.58 — 1222 — 242, — 10 = 0, 
ὡς = 5+ 1/24, where 1 < 2% < 2. 


The computation of convergents and accuracy of results is as follows: 


ακ Ὶ 3 1 5 1 

οἶα a μ᾿ 4 5 29 84 

ἀν ἅν δ... ἃ 3 4 23 27 
Ῥκίᾳι 1.000 1.3333 1.2500 1.2609 1.2592 


Discrepancy —0.2599 +.0734 ἢ —.0099 +.0010 -—.0007. 


Notice that the larger the a; the greater the gain in accuracy from one 
convergent to the next. 
With the above examples in mind it is not hard to prove 


Theorem 4.5a. The continued-fraction expansion of any real positive 
number is unique except that every rational number has two expansions: 


(αι, G2,°**, Ant, Gn), (ai, G2," **, Gn—1, An, 1), 


where a, = a, + 1 and the first n — 1 of the a’s are the same. 

To prove this, let x; be defined as that part of the expansion beginning 
with ας, and 7441 that beginning with a:,:. Then 2; = a; + 1/2i41, where 
δὲρὶ S » Tf χω» 1, this shows that a; is the greatest integer in 2; and 
hence is uniquely determined. However, if x:,; = 1, the continued fac- 
tion may terminate with (ας ἘΞ 1) or with a;+ 1/1. A terminating con- 
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tinued fraction, of course, represents a rational number. See Example 1 
of this section. 

Notice in this example that p./gs = 260/103 and a solution of 5732 — 
227y = —lisx = 45, y = ps. The next to the last convergent of the alter- 
native expansion, 313/124, similarly gives a solution of 5732 — 227y = 1. 
Why is this so? The following is left to the reader to prove. 


Theorem 4.5b. If a/b is a positive rational number in lowest terms 
and if a/b is expanded into a continued fraction with k terms, then 
Qx-1, Pk-1 18 ἃ solution of ax — by = (—1)*. 

This theorem gives a means of solving Diophantine Equations, linear 
in two unknowns, but it is not as good computationally as methods given 
previously in this book. 


Exercises 4.5 


1. Prove Theorem 4.5b above. 

2. Using continued fractions, find a solution in integers of 18% — 23y = 1. 
3. Find the first five convergents in the continued-fraction expansion of 
3.14159 and of 3.14160. Since π lies between these two numbers, what 
does this show about the continued-fraction expansion of 7? 

4. Find the continued-fraction expansion of Va? + 1, Va? + 2 for posi- 
tive integers a. 

5. Approximate by continued fractions accurate to three decimal places 
the positive root of χὅ — 713 -- 1=0. 


4.6. Closeness of approximation 


The convergent 355/113 in the expansion of 7 was used as an approx- 
imation for this constant by the Babylonians. No fraction with smaller 
denominator gives a better approximation. Similarly, 22/7 is the best 
approximation for the size of its denominator. In fact, 


Theorem 4.6a. If p/q is a convergent of the continued-fraction ex- 
pansion for a real number 2, there is no rational number a/b with ὃ S q 
which is closer to 2. 

The proof of this theorem will follow immediately from the 
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Lemma: If a/b <c/d « α' [δ΄ and ab’ — αὖ = ~1, then d is greater 
than ὃ and δ΄. (All letters represent positive integers.) 

First, let us see why the theorem will follow from the lemma. Sup- 
pose px/qz is the kth convergent in the expansion of x and a/b is closer to 
v than is p./q:. Then a/b is between the kth and k — Ist convergent, 
since it is then either between x and p;/q, or between 2) and Pr—1/Qe-1. 
However, pi/qx and prs/qr-1 satisfy the conditions of the lemma, and 
hence 6 is greater than αι. 

To prove the lemma, see that its conditions imply that 

0 <c/d—a/b <a’/b’ — a/b, 
0 < (cb — ad)/db < (a’b — ab’)/b’b = 1/bb’. 
Hence δ΄ (οὗ -- ad) «ἃ. However, οὖ —ad is a positive integer, and 
therefore b’ <d. Similarly, it may be shown that ὃ « ἃ. 
It should be noticed that, though we know from the above theorem 


that if p./q, and pis/q.-1 are two successive convergents of 2, there is 
then no rational number closer to x than p;/q, whose denominator does _ 


not exceed ας; yet there may be rational numbers between the two con- 


vergents which are closer to 2» than p,_1/q,-1 and whose denominators _ 


are between q, and q.1. Suppose k is odd. Since z = q, y = pr is one 
solution of pryit — gerry = 1, as we saw in the previous chapter, all the 
solutions are of the form x = gy + tqu41, y = De + ps1 for integers t. 


Then, if we form the sequence of values of the ratio z/y for t= 


0,1,2, +++, G42, we get the sequence of fractions 

Pk De + Pat De_+ ὥρια. , Pet (ἀν -- 1) Pet Diets 

Ge Ge +e de + Wey Ge + (Gers — Ugerr {μεν 
It follows that αὶ odd implies that p,/q, is less than x, and any rational 
a/b between p;/q, and x» must satisfy just one of the following three con- 


ditions: First, be one of the fractions in the sequence above; second, lie — 


between one fraction and its successor; or, third, lie between the last 
fraction and 2. In the third case, its denominator must be greater than 
Ga+2, Since it lies between pii2/ge+2 and prs1/ge41. In the second case, its 
denominator must be greater than the denominator of the largest fraction 
in the above set to the left of which it lies. We call the sequence above 
the intermediate convergents. Notice that, if we consider the numerator 
the y-coordinate of a point and the denominator the z-coordinate, all 
these fractions represent points on the line pit — φμειὺ = 1. 
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To obtain limits for the closeness of approximation of a convergent 
to the number, x, which the continued fraction represents, let Lit be the 
portion of the expansion of 2» beginning with αν. Then, if az: 18 re- 
placed by +1 in 

Dept _ GeviPe τρια, 
Giz ke TF Ga 


we will have an expression for x. From this it readily follows that 


sa, Ei να ae ha 
δ ας ᾳιίσμμαι + 4...) 


Furthermore, 2n419e + 4ι-ιῈ Ge+iQe + Ge-1 = Qe+1 > Ge, the inequality 
holding in the first instance unless ὦ is a rational number whose expansion 


terminates with a,4:. Furthermore, 
Qe—1 = Veet τ ἀμειθὰ 
implies 
Tepe + Wei = (Teer — Geyr)e + err < ge + eta < 26. 
Combining all these results, we have 


Theorem 4.6b. If p:/q, are convergents of the continued-fraction 
expansion of z, then 


1 1 μι ΔΗ; Ἢ 
es ees SM 1S Ξ 
2qegur1 φείᾳι + Gers) αἴ φῦ «ἢ 
It is not hard to see that the equalities cannot hold if z is irrational, but 
there are values of z making the following differences arbitrarily small: 


Sosy ley (Gd, . ἢ θα. Ea Ia 
ᾳφκάκει ᾳ \ Gel Me Qe + Geta) 


Furthermore, the larger the q:, the closer the approximation. 


Exercises 4.6 


1. In the manner described above, plot on a graph the first three con- 
vergents of ν 17 and the intermediate convergents. 

2. Similarly, plot the first three convergents and the intermediate con- 
vergents of x. 
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3. Fill in the details of the proof of Theorem 4.6b. 


4. Find values of x and k for which one of the differences at the end of 
Section 4.6 is less than .01. Would there be a value of x for which both 
differences would be less than .01 for all & beyond a certain value? 


5. From Theorem 4.6b how close must the third convergents in Exercises 
1 and 2 be to the true value? 


6. Prove that (τ — ρει) αι) (ὦ; — pe_s/Qe—1) = —Qe-1/ e041. 
7. Using continued fractions, find ~/4 accurate to three decimal places. 


8. If a/b and c/d are two fractions in lowest terms for which a,b,c,d are 
positive integers and be — ad > 1, show that there are two positive in- 
tegers r and 8 with s < ἃ and cs — rd = 1. Show that this implies 


a/b <r/s <c/d. 


9. The so-called Farey Sequence of order 5 is the set of rational numbers 
in order which are between 0 and 1 in value and whose denominators are 
not greater than 5, that is, 


0, 1/5, 1/4, 1/3, 2/5, 1/2, 3/5, 2/3, 3/4, 4/5, 1. 


Show that if a/b and c/d are any two successive fractions of a Farey Se- 
quence of any order then cb — ad = 1. In this connection the results of 
Exercise 8 above are useful. (Ref. 6, pp. 23 ff.) 


4.7. Expansions of quadratic surds 


We saw in Chapter II that every rational number has a periodic 
decimal expansion; that is, from a certain point on, the same sequence of 
digits recurs again and again without ceasing. This includes the case 
when the expansion ends with a succession of zeros. Similarly a periodic 
continued fraction is defined to be one in which, from a certain point on, 
there is a sequence of values of the a; which repeats again and again with- 
out ceasing. This definition excludes finite continued fractions, which 
represent rational numbers. It is a remarkable property of continued 
fractions that every periodic continued fraction represents an irrational 
root of a quadratic equation with integer coefficients and, conversely, 
every irrational (and hence real) root of a quadratic equation with integer 
coefficients has a periodic continued-fraction expansion. We first have 
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Theorem 4.7a. Any periodic continued fraction represents a root 
of a quadratic equation with integer coefficients. 

Since the proof is not difficult, we give here merely an outline leaving 
the details to the student. As above, let x, denote that part of the ex- 
pansion beginning with a;. If the expansion is periodic, then for some k 
and r, the portion beginning with a, is equal to the portion beginning 
with a. This shows that, for some k and r, the following equality holds: 
t= 2, Take r <k and let p‘/q; denote the convergents of z,. Then 
we have 


Pn k — Pi, —r—2 
ini eS. Be. 


αι, χΧὅ k — Owes 


which shows that ὦ, satisfies a quadratic equation with integer coeffi- 
cients. Then x = (px—ite  Ρκ-) } (4 σι + Qe-2) shows that ᾧ likewise 
satisfies a quadratic equation with integer coefficients. 

It is somewhat harder to prove the converse of this theorem, that is 


Theorem 4.7b. If x is an irrational root of a quadratic equation, 
ax? + ber +c=0, 
where a,b, and c are integers, then its continued-fraction expansion is 
periodic. 
As above, we see that 
2 = (reper + }κ.-) }(ὥκᾷκ.-α + Qe-2)- 


To simplify our notation, replace vz, pr—1, Pe—2, {κ--ι, 4κ-- DY 2,7,8,ἔ,ιι, Te- 
spectively, and have 


(1) z= γὼ — ts| = . 


Then, by Theorem 4.6b, we have 
= r/t+ εἰ = s/u+ m/v’, εἰ « 1> [η], 
since if 2 — pr_-1/qx-1 (that is, x — r/t) is less in absolute value than 1/qj_, 


it must be 1/g?_, (that is, 1//) multiplied by a number, ε, less than 1 in 
absolute value. A similar statement is true for Ζ — pr_2/qx—2. If we sub- 


.-ὦ» om —— πσν,. ΝΣ - - Bune γ' ᾿ = 
ae ρον» i - 


94 Continued Fractions 


stitute in ax? + bz + c = 0 the value for x given in (1), we get Az? + Bz + 


C = 0, where 


A = ar? + brt + cf’, 
B = 2ars + b(ru + ts) + 2ctu, 
C = as? + bsu + cw. 


Now, if we can show that A,B, and C are bounded, that is, must be 
less in absolute value than some fixed number independent of z,, or z, and 
depending only on z, then, since they are integers, there is only a finite 
number of different equations which z can satisfy. However, the con- 
tinued fraction does not stop, since z is irrational. This fact would show 
that two such equations must be identical and hence two values of z are 
the same. As soon as a value of z recurs, the continued fraction repeats. 
Therefore the rest of the proof consists in showing that A,B, and C are 
bounded. 

Now, — 


A/é# = a(r/t)? + b(r/t) +, 
= a(x — ε΄} + δία — €/*) +, 
= ax? + ba + ¢ — 2aexr/t? + αεἢ [Δ — be/#. 
Thus, since az* + bz + c = 0, we have 
A = —2aex + ae’/? — δε, 
|A| < |2ax| + [αἱ + [δ], 
which is bounded. Using s/w in place of r/t, we see that C is bounded. 
To complete the proof, one may proceed in either of two directions. 


First, it may be shown by direct computation, in virtue of the values of 
A,B, and C given above, that 4A4C — B® = 4ac— ὃ. This is straight- 


forward but rather long. However, it does show that, once A and C are 
determined, then B is determined except for sign. 


Second, it may be shown directly, as follows, that B is bounded. By [ 


the same method as that above, the reader may easily find that 
B = —(eu/t + nt/u)(2ax + b) + 2aen/tuz ~ 


Since r/t and s/w are successive convergents of the continued fraction, 
expansion of x, €, and 7 are of opposite signs, a fact which shows ~ 


leu/t + nt/u| < |eu/t — nt/ul. 
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However, z = r/t + €/f# = s/u + n/u? implies (—ru + st)/tu = €/? — /v’, 
and hence |eu/t — yt/u| = |ru — st| = 1, which shows 


B| < |2ax + ὃ] + |2al, 
and is therefore bounded. 


Exercises 4.7 


1. Fill in the details of the proof of Theorem 4.7a. 

2. Verify by computation that 44C — B* = 4ac — δὲ in the last proof. 

3. Making use of the fact verified in Exercise 2, prove that if αὖ -- D = 0 
where D is a positive integer, the number of equations which the accom- 
panying z may satisfy is not more than 2(ῶ2 ν Ὁ + 1)* and hence that the 
repeating portion of the continued-fraction expansion of VD contains no 
more than 2(2\/D + 1)? terms. As a matter of fact, this estimate has 
been considerably improved. (The maximum is d(2d+ 1), d being the 
greatest integer in Ὁ, Ref. 15, p. 34.) 

4. Find the continued-fraction expansions of »/7, /11, ν 18. Compare 
them with those already found for +/3, ν ὅ, ν 17. Can you find any prop- 
erties common to all these expansions? 


5. Show that 
ioe ab’ + bd! 


a δ] ja’ b’ 
( 
d’ ‘+ de’ οδ' + dd’ 


noticing the element in the first row and column of the product of the 
determinants is the sum of the products of corresponding elements in the 
first row of the left determinant and first column of the right determinant. 
Similarly, the element in the first row and second column of the product is 
obtained by multiplying elements of the first row of the left determinant 
and the second column of the right determinant. Show that the correct 


values for A,B, and C satisfy 
ἌΝ; B 
b 2c} it u B 


2a b 
Use this result to show that 4ac — b? = 4AC — Β', 


T t 


8 " 


- “-Ὡ “Ὁ 
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6. Show that whenever the continued-fraction expansion for z is “pure 
periodic,” that is, if the repeating portion begins with the first term, then 
if k is the length of the period we have 


x= (pet + pe_1)/(Gt + αι... 


Hence show that if z = /D has a pure periodic continued-fraction ex- 
pansion and D is a positive rational number, then pz = qi-1, pi—1/Ge = ἢ, 
and 


pi — Dg = +1. 
7. Show that D in Exercise 6 cannot be an integer. 


4.8. Pure periodicity and reduced quadratic surds 


The remainder of this chapter will be primarily concerned with de- 
veloping a method of solution of the Pell Equation 


“ἢ -- Dy» = 1, 


where D is a positive integer, not a perfect square. Though the equation 
bears his name, Pell apparently did not contribute to its solution. Ex- 
ercise 6 above gave a preview of the kind of connection to be expected. 
However, as was shown, it does not apply for D an integer. To find a 
solution for D an integer we first need more information about the form 
of the continued-fraction expansion of ν 7). We shall first need to char- 
acterize the numbers x whose continued-fraction expansions are pure 
periodic; that is, 


t= (41,d2, eee "και, ee * Or, ee .). 
As in Exercise 6 above, we then have 
φῦ + (στο De) — Pei = O. 


Since the quadratic expression has the value —p,_; when += 0 and 
Qe + Pe — Ge-1 — Pr-1 When x = —1, the latter being positive and the 
former being negative, we see that the equation has a root between 
0 and —1. That is, the other root of the quadratic equation which 2 
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satisfies, in brief, the conjugate of z, lies between 0 and —1. We then call 
a positive root of a quadratic equation with integer coefficients a reduced 
quadratic surd if it is greater than 1 and if its conjugate lies between 0 
and —1. We have thus proved the “only if” part of the following 
theorem. 


Theorem 4.8. A number z has a pure periodic continued-fraction 
expansion if and only if it is a reduced quadratic surd. 

To complete the proof we must show that any reduced quadratic 
surd x has a pure periodic continued-fraction expansion. First, we need 
to show that if x is a reduced quadratic surd, then 2; is also, if 2 is de- 
fined by the equation x = a: + 1/22. If we let y be the conjugate of z, 
then the conjugate of x2, which we may call y2, is defined by the equation 
y = a, + 1/y:, for if f(z) = 0 is the equation for x, the equation for 22 is 
f(a, + 1/22), and f(y) = 0 shows that the equation for ψ: is f(a: + 1/y2); 
hence y2 and 22 satisfy the same quadratic equation. However, y — 1/y:= 
αι > landy «0 implies —1 < y: < 0, which shows that x is a reduced 
quadratic surd. Hence by continuing this process we see that if Ζ is a 
reduced quadratic surd, so is 2, for every k. 

Let x = (αι,α:, " -*,@r, ryt, ***, Orth) Orpkpty ***), Where the repeating 
portion is ἄμμι, ***,@r4%- Now yr = @ + 1/y,+i implies —1/yr41 = Gr — Yr, 
and, since 0 < —y, < 1, we have 


a, = [-1/ Yrril, 


the square bracket denoting the greatest integer in —1/y,4:. Similarly, 
Yarr = ἀμ + 1/Yerrsi implies ἀμ. Ξ [πα]. However, 241 τι 
Trk+1 implies yr1 = Yr+k+1, Which implies that the greatest integers in 
the negative reciprocals of the last two quantities are equal, that is, 
α, = r+. Similarly, we may show that a,1 = @,,.-1 and so on back to 
αι. This completes the proof. 

We leave to the student the proof of the 


Corollary 4.8. The continued-fraction expansion of VD, where D is 
a positive integer, not a perfect square, is of the form 


(a1, Qa, a3; ΤΆ ἄχ, κει, α, ας, ay *); 


where the repeating part is that portion from a2 to κει inclusive. 
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4.9. The Pell Equation 


We can now prove 


Theorem 4.9a. If Disa positive integer which is not a perfect square. | 


and such that the continued-fraction expansion of VD has a repeating 
portion of k terms, then p,,q, is a solution of 


(1) a — Dy? = (-1) 

To prove this notice that x. = 7,42 implies 

T= (Gap opera + De) /(Cep2Qers + Qe) = (ρει + pa) /(T2Ger1 + 4). 
However, « = /D implies that 2, = 1. (ν 7) — αι), and substitution in 
the above displayed equation gives us 

_ Pe + ρι( “Ὁ — αὐ 
Geri + (VD -- αὐ 
which results in the equation 
qeD + VD (qe+1 — Q1Qx) = Pepi + pi(VD — αὐ). 
Equating the rational and irrational parts gives the equations 
(2) e+1 — G1 — De = 9, 
Pi+1 — Axper — GD = 0. 
Eliminating a; by multiplying the first equation by —p, and the second 
by qs, and adding, gives 
—Prderi + WPerr + pj — Daj = 0, 
which results in our theorem. 

We have just shown that a properly chosen convergent of the con- 
tinued fraction expansion of ~D gives a solution of equation (1). We 
shall now show that every solution of 

az? — Dy? = +1 ; 2,y positive, 
is given by a convergent of the continued fraction for WD, More pre- 
cisely, we have 

Theorem 4.9b. If p/q is a solution of one of the equations 


xz? — Dy’ = +1, 
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with p and q relatively prime positive integers, then p/q is a convergent 
in the continued-fraction expansion of WD. 
Now, p* — Dg’ = 1 implies 


lp/a — VD| = 1/(p + gVD)q < 1/2¢, if D > 3. 
Hence, except for D = 2,3, the above theorem will be an immediate con- 


sequence of 


Theorem 4.9c. If |r/s — αἱ < 1/2s* and r and 8 are relatively prime 
positive integers, then 7/s is a convergent of the continued-fraction ex- 
pansion of z. 

To prove this, let the continued-fraction expansion of r/s with k 
odd be (a1,@2, «+ +,a,), and let 2441 be defined by the finite continued frac- 
tion 

& = (1,02, "" "γαχγῦκ 1). 
If x441 is not less than 1, we can expand it into a continued fraction and 
we will have a continued-fraction expansion of z, which, in virtue of its 
uniqueness, is the continued fraction for z. Hence we need only show that 
24.1 80 defined is not less than 1. 

Now, ὦ = (peters + pe-1)/(Qete41 + Ge-1) may be solved for 2x41, 

giving 
χει - πὸ — Pe, 
(1) i —Qit + Dr 
However, ρει = 7/s and the hypothesis of our theorem tells us that 
|x — pr/ge| < 1/2g3. 
Now, & odd implies that z is greater than p,/q:, and hence we may write 
& — pr/ Qe Ξε < 1/23. 
Solving the equation for x, substituting in (1) and simplifying, we get 
Trp = (1 — equQe1)/€9}, 


which is not less than 1 if 1 — €9,9,-. = €q?, that is, e(qj + φεσι-) S 1. 
However, ε < 1/2q? and φι-. < q implies 


e(q? + φιαι-ὐ < (G2  φιαν- ἡ) 248 < (G+ G)/2G = 1, 
which proves our result. 
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Exercises 4.9 


1. Prove Corollary 4.8. 
2. Have we shown that the least positive solution of 


χα; — Dy’ = 1, 


that is, the solution in positive integers for which y is the least, is given 
by the kth convergent in the expansion of ~/D, where k is the number of 
terms in the period? 


3. Would Theorem 4.9c necessarily hold if the inequality were replaced 
by “less than or equal to’’? 


4. Using expansions found in the previous set of exercises, find solutions 
of the equations 


| “ἢ — DY = +1 
for D = 3,5,17,7,11,13. 
5. Show that Theorem 4.9b holds also for D = 2, D = 8. 
6. Show that if D = 3 (mod 4), the equation 


xz? — Dy? = —1 
has no solution. 


7. By use of exercise 6 or other means, show that if D = 3 (mod 4) the 
number of terms in the periodic part of the expansion of ν 7) must be even. 


8. Show that if r,s is a solution of 
xz? — Dy? = +1, 
with D an integer, not a perfect square, then the integers r,, sn, defined by 
(r+ sVD)* = ra + ν 5,, 
n an integer, is a solution of 
| x? — Dy? = (+1)", 
where the ambiguous signs correspond. 
9. Use the result of Exercise 8 to find a solution of 
— 17? = 1 
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from the solution 4,1 of the corresponding equation, with —1 on the right 
side. 

10. In Exercise 8 is it necessary for any or all of 7,8,n to be positive in- 
tegers? Would the result hold if D were a perfect square? 


11. Would the result of Exercise 8 hold if D were a negative integer? If 
so, of what help would it be? 


4.10. Further information about the Pell Equation 


Though we have at hand all the machinery for dealing with the Pell 
Equation, there are two additional refinements which are of interest. 

First, we have not shown that the smallest positive solution of 
x? — Dy* = +1 (that is, the solution in positive integers for which y is the 
least, choosing +1 to be —1 if there is a solution for this choice, otherwise 
choosing ΞΕ 1 to be 1) is the kth convergent, where k is the number of terms 
in the periodic part. From a computational point of view, this informa- 
tion is not necessary, since one would use the first convergent which gave 
a solution, but it would free us from trying any until we reach the kth 
term. Hence we shall prove 


Theorem 4.10a, If the rth convergent of VD gives a solution of 
at — Dy = +41, 


then r is divisible by k, the number of terms in the period of the expansion 
of VD. 

Suppose we show that 2,;2= 22 whenever the rth convergent of 
VD gives a solution of the Pell Equation. Then, if P,Q is the least pair 
of positive integers such that P? — DQ? = +1, P/Q will be a convergent 
of the expansion of VD by Theorem 4.9b. Call it the Rth convergent, 
and have tp42= 22. Then k = R, since if k were less than 2, Theorem 
4.92 would show that the Pell Equation would have a smaller solution 
than P,Q. Then 2,2 = x2 would imply r divisible by k. It thus remains to 
show that z,;2 = 22, whenever the rth convergent of VD gives a solution 
of the Pell Equation. Notice that +1 = (—1)’, since p,/g, is less than 
VD if r is odd and greater if r is even. 


Now, VD = 2 = (pitryi + p—i)/(Grtr41 + 4,..) Shows that 
(pr —_ V Dar) χη, = —Pp-1 + VD4r-1- 
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Multiplying both sides by p,  ν 4, and making use of the fact that 
p? — Dg = (--1}" we get 


Sey = (—1)"{ 4.4... — Pepa + VD(pege—-1 — φΡ»-υ] 
= VD + (-—1)"(Dq.qr-1 — »,Ρ..), 


which implies that 
Tre = Ori + VD — αἱ, where Gp1 = (—1)"(Dargr—1 — PrPr—1) + αι. 
Now, 241 = 741 + 1/2;+2, where, from the line above, 
1/242 = VD — ay = 1/23; 
thus 22 = 2,2, which was to be proved. 
There is a certain interesting symmetry about the continued-fraction 
expansion of ~D, which is quite easy to find. The reader may have ob- 


served that the first k — 1 terms of the periodic part are symmetric about 
the middle. So we shall demonstrate 


Theorem 4.10b. The continued-fraction expansion of ν ἢ, where D 
is a positive integer, not a square, is of the form 


(a, Go, α3, U4, *** , As, ας, Ao, 2a, thie ), 


where the periodic part consists of all the terms exhibited except the first. 


From the first of equations (2) in the proof of Theorem 4.9a we have — 


Qx41/Qe = G1 + pe/ Qe. 


However, Qe+1 = Ge+19% + Qe-1 implies gey1/qe = Ghai + Qe-1/Qe (Compare | 
Exercise 5, section 4.4). Hence ay4: = [Qr41/q], since gi-1 < qe. Con- 


tinuing this process, we have 


ett 


I 
Yk SHE BE se 


1 
= Oey + n/a "ὡ 
and hence 
Gi+1/Ge = (κει, ἄχ, Ger, ***, Ge). 

However, 

αι + pr/Qe = (2a1, ας, Gs, -**, Ax), 
and the equality of the two quantities on the left shows that 

ἄμμι = 201, α: = Ay, ἂς = Apa,***. 


Thus the theorem is proved. 
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While the computation of the kth convergent of VD is not very 
difficult to accomplish, there is a quicker way to find other solutions of the 


Pell Equation, as was suggested in Exercise 8 of the previous section. 
We have 


Theorem 4.10c. If p,q is the smallest positive integral solution of the 
Pell Equation, all positive integral solutions pn,g, are given by the 
formula 


(p+ gVD)" = », + VD ns 


To prove this, suppose that r,s is a positive solution. Then r+ sWD 
will lie between two successive powers of p+qVD. That is, for some 
positive integer m, the following inequalities hold: 

(p+qVD)" <r+sVD < (pt+qvD)". 


Letting (p + VD)" = pm + φων D and dividing through by this quan- 
tity, we have 


(1) 1<ituvD <pt+qvD, 
where 
t+uVD = (r+sVD)/(pm + φων Ὁ) = (r+ 8VD) (Pm — ImWVD), 
the second equality following from Exercise 8 of Section 4.9, and 
t= TDm — 8QmD, U = Ῥ,.8 — Un". 

Replacing VD by —VD in the above, we get 

t—uVD = (r— 8sVD) (pn + Gm D). 
Multiplying corresponding sides, we have 

ὃ — Dw? = (γ" — Ds*)(p? — g2D) = 1. 

Suppose / and wu are positive. Then 
f®=1+ Dv’, p=1+D?¢ 


would show that either {> p and u>g or else t< p and u<q. In- 
equalities (1) then imply that the latter holds, which denies our supposi- 
tion that p,g is the smallest positive solution. Thus, to complete our 
proof we need only show that ¢ and τ are positive. Now, 


(2) p?— 4320 =1=r— Ds? 


— 


| 
| 
| 
: 
| 
Ι 
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implies r > sD, pm > dmVD and hence  >0. Also u has the same 
sign as (Pm — 4,7) (},ρ.8 + Gm7), Which is equal to p?s? — g?r?. However, 
(2) implies p2s* = s*+ g2s°D, gr? = αἵ + Ds’g?, and hence u has the 
same sign as 

(3) pie — git? = Φ = gm fF = p8)/D. 


However, Pm + dmWD<r+sVD implies either pn<r or 4. 48. In 
the former case, the second equality of (3) shows that τ is positive; in the 
latter case the first equality of (8) shows that τὸ is positive. 

Using this theorem, we can find all solutions of the Pell Equation by 
finding the least positive one by continued fractions and Theorem 4.10c 
to find all the rest. 


Exercises 4.10 


1. If the length of the period of the expansion of VD is odd, what will 
be the simplest way to find the least positive solution of the Pell Equation? 


2. The equation x? — Dy? = 1 represents an hyperbola. Draw a graph 
showing the relationship of the solutions of the Pell Equation to the hy- 
perbola. 


3. Show that if z? — Dy* = —1 is solvable and 7,¢ is its smallest solution, 

then all positive integral solutions p,,g, of jz? — Dy*| = 1 are given by 
(p + gVD)" = pat VDQn, 

and that p? — Dg? = (—1)". 


Nonlinear Congruences 


5.1. Introduction 


Just as in algebra we consider roots of equations f(z) = 0, here we are 
concerned with the congruence 


f(z) = 0 (mod m), 


where f(x) is a polynomial with integral coefficients. Sometimes we might 
want to know the roots and at other times we would be satisfied to know 
the number of roots. Of course, in the latter instance we mean the 
number of incongruent roots (mod m). 

We have already found that some linear congruences have no roots, 
some have one, and some have many. However, if the modulus is a prime, 
the situation is more satisfactory, since each linear congruence, where the 
coefficient of z is prime to the modulus, has exactly one solution. We 
shall find that for congruences of higher degree and prime modulus, the 
theory is also quite satisfactory. 


Exercises 5.1 


1. Show that a congruence x? = a (mod p) has two solutions or none if p 
is a prime and a # 0 (mod p). Find integers a and m for which z? = a 
(mod m) has more than two solutions. 

2. List some of the important theorems in the theory of equations, guess 


whether or not they hold for congruences, test your guesses by examples, 
105 
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and then see if you can prove them. One such theorem would be the 
remainder theorem. 


5.2. The remainder theorem 


The reader will recall that in algebra conditional equations and 
identical equations were discussed. A polynomial f(x) was defined to be 
identically equal to zero if, after terms were collected, the coefficient of 
each power of x was zero. A second definition reads: A polynomial f(z) 
is identically equal to zero if f(z) = 0 for all values of x. These two defini- 
tions are equivalent for polynomials whose coefficients are complex num- 
bers, since no such polynomial equation can have more roots than its 
degree unless all coefficients of x are zero. However, the definitions are 
not equivalent for congruence, as can be seen by the example 


a — x = 0 (mod δ), 


which is satisfied for each value of x (mod 5), though the coefficients of 
not all the powers of x are divisible by 5. In fact, 


545 — 5a = 0 (mod 25) 


has 25 roots. 

Since we have used three horizontal lines for congruence, we shall 
use four for identical congruence when we wish to emphasize identity. 
Thus we write 


f(x) S 0 (mod m) 
only if each coefficient of f(x) is divisible by m, and, similarly, 
f(x) = g(x) (mod m) 


when f(z) and g(x) are two polynomials with the property that the co- 
efficient of each power of x in f(x) is congruent (mod m) to the corre- 
sponding coefficient in g(x). For example, we would write 


δα — 2? + Qa — 1ΞΞ 4(5 + 4)? (mod δ), 
but 
«ὃ s& x (mod 5), 
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since in the latter case corresponding coefficients are not congruent 
(mod 5). Strictly speaking, perhaps, we should also write 


6 =1 (mod 5), 


but we customarily omit the fourth line when only numbers are involved, 
since such omission will cause no confusion. Using our notation, f(z) =0 
(mod m) implies that f(a) = 0 (mod m) for all integers a, but f(a) =0 
(mod m) for all integers a does not imply f(x) = 0 (mod m). 

We showed in the theory of equations that if f(z) is a polynomial in 
1 and if, when it is divided by (x — a), the quotient is q(z) and the re- 
mainder is R, then it may be written in the form 
(1) f(a) = ὦ — a)q(x) + ἢ, 
and R = f(a). This was called the remainder theorem. The correspond- 
ing result for congruences is expressed in 

Theorem 5.2. (The remainder theorem.) If f(x) is a polynomial 
such that 
(2) f(x) ΞΞ ὦ — a)q(z) + Καὶ (mod m), 
then R = f(a) (mod πη). 

To prove this, notice that the identity (1) implies the identical con- 


gruence (2) which must hold for all integer values of x. If we then replace 
x by a we get the desired result. 


5.3. The number of roots of a congruence 


We have seen examples of congruences which have more roots than 
their degree, for example, x? = 1 (mod 8) whose roots are 1,3,5,7.  How- 
ever, if the modulus is a prime number, we have 

Theorem 5.3a. If f(x) is a polynomial of degree n, the congruence 

f(z) = 0 (mod p) 
has no more than πὶ distinct roots, p being a prime number. 


Suppose the congruence has a root a. Then f(a) = 0 (mod p), and 
the remainder theorem shows that 


f(x) = ὦ — a)q(x) (mod p). 
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If ὃ is another root, f(b) = 0 (mod p) implies (Ὁ — a)g(b) = 0 (mod p), 
Then ὃ # a (mod p) implies g(b) = 0 (mod p). Thus, by the remainder 
theorem, q(x) ΞΞ (x — b)r(x) (mod p). Continuing in this fashion, we see 
that each root leads to a linear factor, and, since there can be no more 
than πὶ linear factors, our proof is complete. 

There are theorems for moduli which are powers of primes (Ref. 10, 
pp. 85-90), but they are too complex for inclusion here. We leave as an 
exercise the proof of the following theorem. 


Theorem 5.3b. Let f(x) be a polynomial of degree n less than p, where 
p is an odd prime, and suppose 


2? — x = f(x)q(x) + r(x), r(x) of degree <n. 


Then f(z) = 0 (mod p) has n distinct roots if and only if r(z) =0 (mod 
p). If f(0) 4 0 (mod p), 2» — x may be replaced by z?~! — 1 above. 


Exercises 5.3 


1. Is the following true? Suppose a 4 0 (mod m). Then if az? + br + 
6 = 0 (mod m) has one root, it has two? If your answer is “yes,” prove 
it; if “‘no,” give an example. Would your conclusions be the same if the 
modulus were a prime? 


2. A polynomial equation of odd degree with real coefficients must have 
at least one real root. Give an example of a cubic congruence with no 
roots. 


3. Where does the proof of Theorem 5.3a break down when p is not a 
prime number? Does the theorem hold for a power of a prime? 


4. Show that x? = a (mod p) has two roots or none if a is not divisible by 
p (p being an odd prime), and hence there are (p — 1)/2 values of a for 
which z* = a (mod p) has a solution. If the congruence is solvable, a is 
called a quadratic residue of p; if not, a nonresidue. 


5. If Risa quadratic residue of p and N a quadratic nonresidue, show 
that RN is a quadratic nonresidue for p an odd prime. - 


6. Let Ri, R2,---, Ri, k = (p — 1)/2, be the quadratic residues of an 
odd prime p, and N a quadratic nonresidue. Show that NRi, NR, --- , 


qi 
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NR, are the quadratic nonresidues. Show that if N: and N, are any two 
nonresidues, the congruence Niaz? = N, (mod p) is solvable. 


7. Using Fermat’s Theorem, show that 
ar t— -- (ἡ — 1)(4— 2) --- (@9 --, p+ 1) = 0 (mod p) 


of degree Ὁ — 2 has p — 1 roots and hence, by Theorem 5.3a, has all its 
coefficients divisible by p, p being an odd prime. Show that this implies 
Wilson’s Theorem. See Sections 2.5 and 2.11. 

8. Prove Theorem 5.3b. 


9. In a manner analogous to the definition of 7 = ἡ -͵α, define for a 
given prime p and a quadratic nonresidue N, a number 7 such that 7? = Ν. 
Show that every quadratic congruence zx? + ba - ὁ =0 (mod p) has a 
solution of the form r+ sj, where r and 8 are integers. Hivr: Write 
w+ ba +ec= (x + b/2)? + c — b?/4 and see that —c + b?/4 = 7? (mod p) 
for some integer 7, or else, using Exercise 6, —c+ 0?/4 = Nr = 7° 
(mod p) for some r. 

10. Show that 2? = 2 (mod 7) has no solution of the form r + sj, where 
rand s are integers and j is defined so that 72 = 3 (mod 7). 

11. Show how the solutions of the congruence f(z) = 0 (mod m) could 
be obtained from the solutions for the prime power factors of m. 
12. Show that az*-+- by? =c (mod p) is solvable if (ab,p) = 1. 
ercise 9 of Section 2.5. 


See Ex- 


5.4. Power residues 


The above exercises show that not much of the usual theory of 
equations carries over to the solution of polynomial congruences in num- 
ber theory. Perhaps this was to be expected, for we noticed in the be- 
ginning that the problem of divisibility which is trivial for real numbers 
is a matter of concern in the theory of numbers. So the solution of 
x? = αὶ in complex numbers is always possible, but the corresponding 
congruence (mod p) often has no solution. A definite criterion is given 
by 

Theorem 5.4. If pis a prime and (p,a) = 1 the congruence 

χ" = a (mod p) 
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is solvable if and only if at = 1 (mod p), where 6 is defined by (p — 1)/e = 
(n, Ὁ — 1). When it is solvable there are (p — 1)/e solutions. 

We postpone until later the proof for all n, being content for the 
present with the cases when ἢ is a divisor of p — 1 or whena = 1. In th 
first case n, a divisor of (p — 1), implies (n, p— 1) =n andne=p—1. 
Then, if we notice that x” — αὐ is divisible by 2" — a, we can write 


ane — αὐ = (τ" — α)ᾳ(α), 

where g(x) is a polynomial with integer coefficients. Hence 
ἀν 1 =a” -- αὐ - αὐ -- 1 = (2*—a)g(z) + αὐ --᾿Ἰ. 
If χ5-- a = 0 (mod 7p) has any solution 2, Fermat’s Theorem and the 
above identity show that 
(2x — a)q(a) + α΄ -- 1 = 27 *— 1 = 0 (mod p), | 

and hence 

at — 1 = 0 (mod p). } 

Conversely, if the last congruence holds, Theorem 5.3b shows that 
χ" — a = 0 (mod p) 


has n solutions. This completes the proof for the case when n divic 38 
—1. | 
᾿ We can also prove the theorem when a = 1 without restricting n to Ὁ 2 
a divisor of p—1. To do this, let k = (η, Ὁ -- 1). We know that the Z 
are integers r and s such that mr + (p — 1)s = k. Then x» = 1 (mod p) 
and zp! = 1 (mod p) imply 
gart(e-V)e == χὰ = 1 (mod p). 


Conversely, if k divides n and χὰ = 1 (mod 7), it follows that 23 = ' 
(mod p). What we proved in the preceding paragraph then shows ᾿ " 
there are k solutions of z* = 1 (mod p). Hence the number of solutions 
of x" = 1 (mod p) is k, which is equal to (p — 1)/e. . 
7 


Exercises 5.4 


1. If p = 1 (mod 8), show that 2* = a (mod p) has 3 or no solutions for 
(p,a) = 1. Where there are solutions, express them in the form το, Τοῦ, ζοῦν 
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where v is independent of x) and 2 is one solution of the congruence. 


2. If p = 2 (mod 3), show that z* = a (mod 7) has exactly one solution. 
Express it in the form a’, where r is a fixed integer independent of a. 


3. Assuming Theorem 5.4, prove that zt = —1 (mod 7) for an odd prime 
p, if and only if p = 1 (mod 8); that is, all odd prime divisors of z* + 1 
are congruent to 1 (mod 8). Why does this show that there are in- 
finitely many primes congruent to 1 (mod 8)? 

4. By the method of Exercise 3, prove that for any positive integer r 
there are infinitely many primes of the form 


p=1+2.N,N =1,2,3,°°°. 


There is a famous theorem of Dirichlet which generalized the above re- 
sults. It states, in fact, that for any integers a and b without a common 
factor greater than 1 there are infinitely many primes of the form a + Nb, 
N =0,1,2,++-. In other words, if (a,b) = 1 there are infinitely many 
primes in the arithmetic progression 


a,a+b,a+2b,a+3b,---. 
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There is another question connected with power residues suggested 
by Fermat’s Theorem. Certainly there are often smaller powers of a than 
αν which are congruent to 1 (mod p), for example, 2* = 1 (mod 7), but 
for each prime p is there an integer a such that no power of a smaller 
than p — 1 is congruent to 1 (mod p)? For instance, the successive pow- 
ers of 3 (mod 7) are congruent to 3,2,6,4,5,1 (mod 7), and 3 is such a 
number. We call a number a, a primitive root (mod p) or a primitive root 
of pif p — 1 is the least positive power of a which is congruent to 1 (mod p) 
and prove 


Theorem 5.5a. Every prime p has ¢(p — 1) primitive roots, where @ 
is the Euler ¢-function. 

In fact, suppose we say that “‘a belongs to the exponent k (mod p)” 
if k is the least positive power of a which is congruent to 1 (mod p) and see 
that Theorem 5.5a is included in 
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Theorem 5.5b. If kis any divisor of p — 1, there are $(k) numbers 
(mod p) which belong to k (mod 7). . 

Theorem 2.6b implies that k must be a divisor of p—1. By Exercise 
17 in Exercises 2.9, we have seen that if a belongs to k (mod p) then a’ 
also belongs to k (mod p) if and only if (7, k) = 1. Hence, for each divisor 
k of p — 1, there are ¢(k) or no numbers belonging to k (mod p). If then 
we let p(k) stand for the number of integers belonging to k (mod p), we 
see that for each k, p(k) is either 0 or ¢(k). Since every number from 1 to 
p — 1 belongs to some k (mod p), we have 


p —1=p(1) + p(k:) + +++ + phy), 
where 1, ke, ks, ++, ky are the divisors of Ὁ — 1. However we showed in 
Theorem 2.10b that 
p—1= $(1) + φίζι) +--+ + φίλω). 
Hence ρ(κὺ = ¢(k,) for every k;, and our theorem is proved. 
For example, we have the following: 
TABLE I 
Ν 1 we ἃ ὅ. 6. .8 9 1ὄ 1 τ 
Belongs (mod 13)to|1 12 8 6 4 12 12 4 3 6 12 2 


Note that 4 = φ(12) numbers belong to 12; 2 = φ(θ) to 6; 2 = φ(4) to 4; 
2 = 6(3) to 3; 1 = $(2) to 2; and 1 = ¢(1) tol. ; 

The primitive roots of a prime p serve a very useful purpose in that 
they can be made a basis for a kind of logarithm which expedites con- 
gruential computation. Notice that from the table above, 2 1s a primitive 
root (mod 13). A table of the residues of the powers of 2 (mod 13) is 
below: 


TABLE II 
αὶ ἐδ νυ. ὅν. ced: » eed > BaP da Th 
Hie 4261506 Both δ δ᾽ Ὁ TMs 


The values of n are usually called the indices for the numbers 2” for the 
primitive root 2. We shall adopt the nomenclature of logarithms and 


write 
log,» a = ὃ to mean g’ = a (mod p) 
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for g, & primitive root (mod p). For instance, we would have from Table 
II, logs,i3 5 = 9. Any number except 0 (mod 18) will thus have a log to 
the base 2 (or any other primitive root). The following examples illus- 
trate the use of a table of indices or logs. 

EXAMPLE 1. Solve 3x = 5 (mod 13). Table II shows that 3 = 2", 
5 = 2° (mod 13). Hence our congruence may be written 2‘¢ = 2° (mod 13) 
and 2 = 25. = 25 = 6 (mod 13). How would you similarly solve 5a = 3 
(mod 13)? 

ΕἸΧΑΜΡΙῈ 2. Solve zt = 7 (mod 18); that is, zt = 2" (mod 13). If 
there were a solution, x would be a power of 2, say 12 ΞΞ 2", and then 
a4 = 2" and 25 = 2" (mod 13), which would imply (why?) that 4r = 11 
(mod 12), an impossibility, showing that there can be no solution. We 
could have deduced this from Theorem 5.4, since 12/4 = 3 and 78 # 1 
(mod 18). 

EXAMPLE 3. Solve 25 = 7 (mod 18); that is, τῇ = 2" (mod 13). Let 
7 = 2" and see that we must have 5r = 11 (mod 12). The only solution 
of the last congruence is r = 7, and hence the only solution of our given 
congruence is z = 27 = 11 (mod 13). 


Exercises 5.5 


1. Use Table II to find all solutions of zt = 9 (mod 13). 


2. What other primitive roots does 13 have beside 2? Form a table, 
such as Table IJ, for each such root. Use one table to solve the congru- 
ence in the previous exercise. 


3. Show how Table I could be constructed from Table II. 
4. Using primitive roots, prove Theorem 5.4. 


5. Show that χ᾽ = a (mod 2) is solvable if and only if log, ,a is divisible 
by (e, p — 1) for every primitive root g. 


6. How would you define a primitive root (mod p*) where Ὁ is a prime? 
Using your definition, find whether 9 has a primitive root. Does 8 have 
& primitive root? 


7. Find all the numbers that belong to ¢(m) for m = 3,9,27. 
8. Find all the numbers which belong to 2*-* (mod 2*) for k = 3,4,5,6. 
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It is natural to call α a primitive root of p* if ¢(p*) is the smallest 
power of a which is congruent to 1 (mod p). Exercise 6 above shows that 
8 has no primitive root but the following theorem holds: 


Theorem 5.6a. If p is an odd prime, p’ has a primitive root for every 
positive integer r. 

First, we find primitive roots (mod p’). Leta be a primitive root 
(mod p), and suppose a + kp belongs to ¢ (mod p*). Now, (a+ kp)‘ = 
1 (mod 7) implies t = 0 (mod p — 1), and since from Theorems 2.6a and 


2.60, ¢ also divides p(p — 1) = $(p”) it must be either Ὁ — 1 or p(p— 1). © 


If it is the former we have 


(a+ kp)? = 1 (mod p’); that is, 
(a+ kp)? ΞΞ α - kp (mod p’). 


Thus a? — a = kp (mod p’), which holds if and only if k = (a? — a)/p 
(mod p). Hence, for p — 1 values of k the last congruence fails to hold, 
and for these values a+ kp is a primitive root; that is, p? has (p — 1) 
¢(p — 1) primitive roots, since there are ¢(p — 1) primitive roots, a, of p. 
We shall show below that all primitive roots of p* are obtained in this 
manner. 

Before we consider primitive roots of higher powers of p, notice that 
% is the only primitive root of 3. The primitive roots of 9 are 2 and 5 but 


not 8. However, all the numbers congruent to 2 (mod 9) are primitive — 


roots of 27, and a similar statement may be made for numbers congruent 
to 5 (mod 9). This suggests that any primitive root (mod p*) may be 8 


primitive root for all higher powers of p. We now show that this is the. 


case. 
First, we need the following auxiliary result: 


Lemma: Let p be an odd prime and at = 1+ kp* (mod p**’) with © 


s > 1; then 
a’? = 1+kp't (mod pth), 


First, we prove the lemma for r= 1. The congruence of the hy- 
pothesis is equivalent to the equation 


at = 1+ kp + up™, 
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where wu is an integer. Raise both sides of the equation to the pth power 
and obtain 


a’? = (1+ kp? + up)? = 1+ p(kp* + up) + 
(p/2)(p — 1) (kp* + up)? + --- 
= 1+ kp (mod p™) , 


since all terms but the first two in the expansion are divisible by p**. 
This completes the proof for r = 1. 

To prove the lemma by induction assume it true for 7 replaced by 
r — 1; that is, assume that the hypothesis of the lemma implies 


a’? = 1+kptt (mod pt), 


If, in the hypothesis of the lemma, we replace ¢ by tp” * ands bys +r -—1, 
we have the last congruence. Hence, by the conclusion of the lemma for 
r = 1, it follows that 


(α΄ )» =1+ κρηῖτ τῖι (mod prtrtt), 


which is what we wished to prove. 

Second, every primitive root of ργ is a primitive root of p*, for if a 
belongs to ¢ (mod p*) we have a‘ = 1+ kp* and hence, by the lemma, 
a‘? =1 (mod p'); and if a is a primitive root of pt’, tp” = 0 (mod 
¢(p***)). However, 4(p*t") = p’¢(p*), and hence ἑ = 0 (mod ¢(p*)), anda 
is ἃ primitive root of p’*. 

Third, suppose a is a primitive root of p?. From what we have just 
proved it must be a primitive root (mod p), and hence a? = 1+ kp, 
where k is prime to p. Then, by the lemma with t = p — 1 ands = 1, we 
have 


(1) a?) = 14 kp (mod pr). 


Thus, if r=1, a®” =1+kp* (mod p*), which shows that a®”” #4 
1 (mod p*). If, however, a belongs to ¢ (mod p*), at = 1 (mod p*) implies 
t is divisible by ¢(p”), and hence ¢ must be ¢(p”) or ¢(p*). The former 
would contradict the previous incongruence. Hence ¢ must be ¢(p*), and 
a is a primitive root of p*. Similarly, (1) implies a*®” #1 (mod p‘), 
and thus a is a primitive root of p*. Continuing in this fashion, we see 
that every primitive root of p? is a primitive root of p’ for all r, and every 
odd prime p. 
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The number 2 seems to be a peculiar prime. For instance, the 
Euler phi-function of a power of p is a power of p only when p = 2, One 
other respect in which it differs in behavior from the other primes is that 
no power of 2 beyond 4 has a primitive root. Notice that 3 belongs to 
2 (mod 8), 4 (mod 16), 8 (mod 32), and so on. In fact, we have 


Theorem 5.6b. If ¢ > 3, an odd number 1 + 2n belongs to 25 2 (mod 
22) or a lesser power of 2 according as n = 1,2 (mod 4) or 0,3 (mod 4). 


} 
' 


This depends on the following auxiliary result, which we prove first. 


Lemma: If » 3, then 
( Ὁ 2n)* = 1+ 2**(n — n? + Qn‘) (mod 24. 


This lemma can be shown by expanding the left side by the bino- 
mial theorem, but this leads to certain complications which may be 
avoided if we elect to prove it by induction as follows. 

If t= 4, (1+ 2n)?=1+ 4n+ 4n? =1+4(n+ n°), which gives the 
formula of the lemma, since n+ n? = ἢ — n?+ 2n‘ (mod 4) for all ἢ. 
If t= 5, (1+ 2n)*= 1+ 4-2n+ 6-4n? + 4: 8η3 + 16n! 

= 1+ 8(n+ 3n? + 2n*) (mod 32) 
= 1+ 8(n — "2 + 2n‘) (mod 82). 


_ We now assume the lemma for ¢ > 5 and seek to prove it for i+ 1. 
That is, we may assume 


( Ὁ 2n)2 = 1+ 22(n — n? + 2n') + 24, 
where 7 is an integer. Squaring both sides, we get 
(1+2n)* =1+2*(n — n?+ 2n‘) (mod 2"), 
since 2(έ -- 2) > t+ 1 when ¢ > 5. This completes the induction and 
proves the lemma. 

To prove the theorem, notice that if n =0 or 3 (mod 4), n—n?+ 
2n* = 0 (mod 4) and hence (1+ 2n)? = 1 (mod 2*) and 1 + 2n belongs 
to a power less than 25 2 (mod 2"). On the other hand, if n = 1 or 2 (mod 
4), ἢ -- πῷ -Ὁ 20. = 2 (mod 4) and (1+2n)?>) =1+2** £1 (mod 25. 
However, (1+ 2n)* = 1 (mod 2‘). Hence we have 


Corollary 5.6b. If ¢ > 4, m belongs to 2: “2 (mod 2) if and only if 
m = +3 (mod 8); otherwise m belongs to a smaller power. If ¢ = 3, the 
numbers 3,5, and 7 all belong to 2 (mod 8). 


| 
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‘Theorem 5.6c. If we define a primitive root (mod m) to be a number 
a such that $(m) is the least power of a which is congruent to 1 (mod m), 
then m has a primitive root if and only if m is one of the numbers 


2, 4, p", 2p", 


where p is an odd prime. We leave the proof as an exercise. 
Exercises 5.6 


1. Are there values of 8 and r such that 5° = 3* (mod 2°), whereO <r < 2! 
and 0 <s < 2#? 


2. Since Corollary 5.6b shows that 5 belongs to 2‘~* (mod 2°), why does it 
follow that δ΄ takes on exactly 2° different values (mod 22) as r ranges 
over the positive integers not greater than 2°? Hence show that for 
every odd number a there is a positive integer r such that, for the proper 
choice of sign 


a = +5’ (mod 22,0 <r < 2, 


Also show that r (with these restrictions) and the sign are unique. Would 
the same be true if 5 were replaced by 3? 


3. Suppose m = 2'op tp 2- . ps, where the p; are distinct odd primes. 
Let go be chosen so that 


go = 5 (mod 8), go = 1 (mod m/24), 
the ambiguous sign + so that 
a = +1 (mod 4), 
and, for each positive 7, choose g; a primitive root (mod p,) such that 
9: =1 (mod νι ἢ). 


Why is such a choice possible? Show that for every integer a, prime to m, 
there are positive integers r; < o(p.') and an integer ry which is zero or 
not greater than ᾧ — 2, such that 

a = -£9,",19,? «ρ΄ (mod m), 


for the above choice of ambiguous sign. Hriyr: Show that the last con- 
gruence holds for each prime power dividing m. Furthermore, show that 
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the σε and a uniquely determine the τὶ and, if m is divisible by 4, the am- — 


biguous sign. 
4. Show that if m is factored as in Exercise 3 and a is any number prime 
to m then a* = 1 (mod m) where k is the least common multiple of 


2", $(p,! or "ἢ $(p,'). 


Is there a number a such that this k is the least power congruent to 


1 (mod m)? Under what conditions is k = ¢(m)? 

5. Prove Theorem 5.6c. 

6. Find the number of primitive roots of ρ΄, for p an odd prime. 
7. How many numbers will there be belonging to 2. 2 (mod 2‘)? 


Quadratic Residues 


6.1. Summary of previous results 


Recall that we have called a a quadratic residue (mod m) if (a,m) = 1 and 
x? = a (mod m) 


has a solution (see Exercises 5.3). If (a,m) = 1 and the congruence is 
not solvable, we call a a quadratic nonresidue or, simply, nonresidue. If 
(a,m) ¥ 1, ais neither a quadratic residue nor a nonresidue. 

In the same exercises we showed that each odd prime number p has 
(p — 1)/2 quadratic residues and the same number of nonresidues. This 
could also be seen from the fact that the even powers of a primitive root 
of p are quadratic residues and the odd powers are the nonresidues. Why? 

Furthermore, an immediate consequence of Theorem 5.4 is 


Theorem 6.1. (Euler’s Criterion.) If (a,p) = 1 and pis an odd prime, 
then a is a quadratic residue (mod p) if 


a-D/2 = +1 (mod p), 
and a nonresidue if 

αΟ-Ὁ Ὁ} = —] (mod 7). 
One of these congruences must hold if (a, p) = 1. 


Corollary 6.1. If p = 1 (mod 4), then —1 is a quadratic residue of 
p; if p = —1 (mod 4), it is a nonresidue. 
The proof of the theorem and corollary is left as an exercise. 
119 
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Exercises 6.1 


1. Show why Theorem 6.1 follows from Theorem 5.4, and why the corol- 
lary follows from the theorem. 

2. Show that the even powers of a primitive root of p are quadratic residues 
(mod p) and the odd powers nonresidues. 

3. Show that the statement in Exercise 2 remains true if p is replaced by 
p’, where now p is an odd prime. 

4. If ais prime to p, why must one of the congruences of Theorem 6.1 
hold? 

5. Show that ὃ is a quadratic residue of an odd prime Ὁ if and only if it is 
a quadratic residue of all powers of p. There are at least two methods of 
proof, suggested as follows: 


a. Use Exercise 3, above, and the fact that any primitive root of p’ is 
a primitive root of p. 

b. Show that if x is a root of x? = a (mod p), then k may be determined 
so that (%) + kp)? = a (mod p*), and so on for higher powers of p. 


6. Show that ὃ is a quadratic residue of 2‘, ¢ > 3 if and only if b=1 
(mod 8). Two methods of proof are 


a. Show that the even powers of 5 (mod 2‘) are congruent to the num- 
bers congruent to 1 (mod 8) and less than 2‘, in other words, that 
ὃ = 1 (mod 8) implies 5% = ὃ (mod 2°) is solvable for 2. 

b. Show that if x is a root of xz? = a (mod 8) then k may be determined 
so that (a + 4k)* = a (mod 16), and so on for higher powers of 2. 


6.2. The Legendre Symbol 


The results of Exercises 5 and 6, above, show that whether or not a 
number is a quadratic residue of a power of a prime depends on the same 
information for the prime itself. For purposes of easy reference we list 
these results as 


Theorem 6.2a. A number aisa quadratic residue of p" if and only if 
1. It isa quadratic residue of p for p odd, 

2. a = 1 (mod 8) if p = 2, r 2 3, 

3. a@ = 1 (mod 27) if p= 2 andr = 1 or 2. 
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The Chinese Remainder Theorem shows that the quadratic residues of a 
composite number may be found from the quadratic residues of its prime 
power factors. We leave as an exercise the proof of 


Theorem 6.2b. The number ὃ is a quadratic residue of a composite 
number m prime to b if and only if it is a quadratic residue of each odd 
prime factor of m and, in addition, 


ὃ = 1 (mod 4) if m = 0 (mod 4), 
b = 1 (mod 8) if m = 0 (mod 8). 


Though Euler’s Criterion gives a way of determining the quadratic 
character of a number (mod 7), that is, whether or not it is a quadratic 
residue, it is cumbersome in practice. There is a much more elegant 
method embodied in the Quadratic Reciprocity Law which we shall prove 
later in this chapter. The first step on the road to this result is the 
Legendre Symbol (a|p), defined for p an odd prime as follows: 


(a|p) = 1 if ais a quadratic residue of p, 
(a\p) = —1 if ais a quadratic nonresidue of p, 
(alp) = 0 if a = 0 (mod p). 
ς t + : a ὕω " ᾿ 
This symbol was originally written (*), but the notation given above is 
coming into vogue and is much more popular with the printers. There 


is a certain elegance about this symbol, since it has the properties listed 
below. 


Theorem 6.2c. The Legendre Symbol has the following properties: 


1. If (a, p) = 1, (a\p) = a®—? (mod p) by Euler’s Criterion. 
2. The symbol is multiplicative, that is, 


(a|p) (b|p) = (ab|p). 
3. If a = ὃ (mod p), then (a|p) = (Ὁ). 
4, If (r, p) = 1, then (ar*|p) = (a|p). In words, a square factor prime 
to p may be deleted from the left portion of any symbol without 
altering its value. 


The proof is left as an exercise. 
The Legendre Symbol is a particular example of what is called a 
character, which could be defined as follows: Let g be a primitive root of 
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p, and let p be a number for which p?~! = 1; that is, p is a p — 1 root of 
unity. Then define the symbol 

(alp), 


to be zero if (a,p) τέ 1, and ρ' if (a,p) = 1, where ¢ = log, ,a. You must 
recall that ¢ = log, ,a was defined to be that unique positive integer less 
than p for which a = gt (mod p). For the Legendre Symbol, p is —1, and 
the value of the character is independent of g. Usually the value of the 
character depends not only on p but on g. 


Exercises 6.2 


1. Prove Theorem 6.2b. 


2. If p =1 (mod 3) and p = w, an imaginary cube root of unity, show 


that the character (a|p), is not independent of the primitive root of Ὁ 
but that if (alp). = 1 for one primitive root it is equal to 1 for every 
primitive root. 

3. Prove Theorem 6.2c. 

4. Prove that properties 2 and 3 of Theorem 6.2c hold for the symbol 
(alp),. 

5. Prove that (1|p) + (2|p) + --: + (p— 1|p) = 0, if pisa prime. The 
same result may be shown to hold for the character defined above. 


6.3. Gauss’s Lemma 


There is a lemma of Gauss which gives a rather curious way of find- 


ing the quadratic character of a number (mod p). From a computational — 


point of view it is not very useful, but it has at least two claims to im- 
portance: First, it gives a means of determining the primes of which 2 is 
a quadratic residue; and second, it is an important step on the road to the 
Quadratic Reciprocity Law whose proof is the goal of this chapter. 

Before stating the lemma formally, let us see how it works in a par- 
ticular example. Suppose we wish to determine whether or not 7 is a 
quadratic residue of 17. Write 


1-7, 2.7, 3-7, 4-7, δ.7, 6°7, 7.1, 8-7, 


= 
δ οἷ τ —————— 
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that is, the first (p — 1)/2 multiples of 7, where p = 17. Their least posi- 
tive residues (mod 17) are in order 
7, 14, 4, 11, 1, 8, 15, 5. 

Notice that each residue is derived from its predecessor by adding 7 and, 
if necessary, reducing (mod 17). Now, three of these are greater than 
17/2, and Gauss’s Lemma affirms that (7|17) = (—1)* = —1; that is, 7 is 
a nonresidue of 17. In a similar fashion, 13 can be shown to be a quad- 
ratic residue of 17. 


Theorem 6.3a. (Gauss’s Lemma.) If p is an odd prime, (q,p) = 1, 
and τ is the number of least positive residues of the set 


(1) q; 2q, 3g, Δεν: γί -- 1)/2 
which are greater than p/2, then 
(qlp) = (--1)". 


Let δι, be, «++, bs be those least positive residues (mod p) of the set 
(1) which are less than p/2 and ci, cs, --+, συ those which are greater than 
p/2. Then u+t= (p— 1)/2, and the numbers 


(2) Oy, Os, «++, By, D — Cr, D— Cs, ° °° τ΄ Cy 


are all positive and less than p/2. 

If we could show that the numbers of (2) were all distinct (mod p), 
they would have to be 1, 2, ---, (p — 1)/2 in some order, and we would 
have 


q(2q)(3q) «++ φίφ — 1)/2 = bibs -- + becxer- ++ Cy 
= (—1)"bib.--- bi(p Ἕ οι)» - δ) 5" φ -- δι) 


= (-1)(2 ==) 1 (mod p). 


That is, 


go-va(P —*) l= (-1)4(? *) ! (mod p). 


Since (2): is prime to p, we may divide both sides of the congruence 
by it and have 

q?-Y? = (—1)* (mod p). 
Use of Euler’s Criterion then would complete the proof. 
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It remains to show that the numbers of (2) are distinct (mod 7). 
Suppose ὃ; = ὃ; (mod p), with z #7. Then b; = rq, ὃ; = sq (mod 7p) with 
r #8 would imply rq = sq (mod p), and hence r= 8 (mod p). This is 
impossible, since r and s are positive, distinct, and less than p/2. Sim- 
ilarly, p—c; #p—c; (mod p). Finally, if b; = p—c,, where ὃ; = rq, 
6; = sq (mod p), we would have rg = —sq (mod p); that is, r = —s (mod p), 
or r+s =0(mod p). This is impossible, since both r and 8 are positive 
and less than p/2. Thus the proof of the theorem is complete. 

The first use we make of this theorem is to show 


Theorem 6.3b. If p is an odd prime, 
(2|p) = (-I)"*; 


that is, (2\p) = 1 if p = +1 (mod 8), and (2|p) = —1Lif p = +3 (mod 8). 
To show this, let g = 2 in Theorem 6.3a, and the set (1) becomes 


(3) 2, 2.2, 3-2, +++, 2(p — 1)/2. 


These numbers are all less than p and hence are all least positive residues 
(mod p). Hence we must determine how many are greater than p/2. 
First, if p = 1 (mod 4), then 


2(p — 1)/4 < p/2 and 2-(p + 3)/4 > p/2. 


Hence u= (p — 1)/2 — (p— 1)/4 = (p — 1)/4; thus τὸ is even if p = 
1 (mod 8), and odd if p = —3 (mod 8). Second, if p = —1 (mod 4), then 


2(p — 3)/4< p/2 and 2(p + 1)/4 > p/2. 


Hence τ = (p — 1)/2 — (p — 3)/4 = (p + 1)/4; thus τὸ is even if p= 
—1 (mod 8), and odd if p = 3 (mod 8). Hence wis evenif p = +1 (mod 8), 
and odd if p = +3 (mod 8). In the former case (p? — 1)/8 is even, and 
in the latter it is odd. This proves the theorem. (For other proofs of 
this theorem, see Ref. 14, pp. 278 ff.) 

. There is another criterion for quadratic character which takes us 
further along the way to our ultimate goal. It is 


Theorem 6.3c. If 
M = [q/p]+[2¢/p]+ --- ἜΤ -- 1)¢/2p], 
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that is, if M is the sum of the greatest integers in the numbers of set 
(1) of Theorem 6.3a, divided by p, then 


u = M + (ρ" -- 1)(¢— 1)/8 (mod 2), 
where τὸ is defined in Theorem 6.3a, and p is an odd prime. Recall 


(qlp) = (--1)". 
First notice that 


ig = p-[iq/p] τῳ 


where 7; is the least positive residue of 7g (mod p) and hence is one of the 
b,’s or c,’s of Theorem 6.3a. Thus 


(p—1)/2 
a iq=pM+ritrete ss $744 
= = pM +bitbet::: tbe teteet +++ +e, 
=pM +bit-->+b:+ up — (p— οὐ — (p— es) — +> 
τῷ — ¢) 


= pM + up + Dib + Dip — οἡ (mod 2). 
Then, since the numbers of (2) are 1, 2, ---, (Ὁ — 1)/2 in some order and 
p is odd, 
> ig = M+u+t >i (mod 2), 
u=M+ Σὲ -- φ' ΣᾺ (mod 2), 


all sums being over 7 from 1 to (p -- 1)/2. Each of the sums in the last 
congruence is, by the formula for the sum of an arithmetic progression, 
equal to (p? — 1)/8. Hence 

u=M-+ (1 — q)(p? — 1)/8 (mod 2), 


and the theorem is proved, since 1 — g = g — 1 (mod 2). 


Exercises 6.3 


1. Show that Theorem 6.3c implies Theorem 6.3b. Show that (m? — 1)/8 
is even if and only if m = +1 (mod 8). 

2. Of what primes is —2 a quadratic residue? 

3. Use Theorem 6.3c to find the quadratic character of 13 with respect 
to 17. 
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4. Show that if ᾳ = 3 in Theorem 6.3c then M = (p — 1)/6 or (p + 1)/6 
according as p = 1 (mod 8) or p = —1 (mod 8), hence that 


u = (p— 1)(8p + 5)/12 or (p + 1)(8» — 1)/12 (mod 2) 


in the respective cases. This shows that 3 is a quadratic residue of an odd 
prime p if and only if 


p = 1 or 11 (mod 12). 


5. Of what primes is —3 a quadratic residue? 


6. We showed in Exercise 3 of Exercises 5.4 that there are infinitely 
many primes of the form 8N + 1. If “ἢ — 2 is divisible by an odd prime, 
the prime must be congruent to +1 (mod 8), since 2 is a quadratic residue 
of only these primes; not all the prime factors of z? — 2 can be = 1 (mod 8), 
since for x odd, “ἢ -- 2 = —1 (mod 8). Hence, for each odd z, x? — 2 has 
@ prime factor Ὁ = —1 (mod 8). Why does this imply that there are 
infinitely many primes of the form 8N —1? Devise similar proofs for 
the forms 8N + 3, 8N + 5. 


7. Of what primes is 6 a quadratic residue? 


6.4. The Quadratic Reciprocity Law 


This law has been characterized as one of the most beautiful results 
in the theory of numbers. It was stated by Euler in 1783 and first proved 
by Gauss thirteen years later, who gave several proofs. There have since 
been many proofs. Because the elegance of the result is not at first ap- 
parent, after stating it we shall first show how it is used. 

Theorem 6.4a. (The Quadratic Reciprocity Law.) If p and gq are 
two distinct odd primes, then 


(qlp) (pig) = (-- 1), 
where t = (p — 1)(q—1)/4. That is, 
| (qlp) = (p|q) if p = 1 (mod 4) or g = 1 (mod 4), 
(qlp) = — (pla) if p = ᾳ = ὃ (mod 4). 


The second form of the law, which is easier to use in practice, fol- 
lows from the first, since ¢ is even unless both p and g are congruent to 
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3 (mod 4). First, let us see how the law applies to the evaluation of 
(35|71). By property 2 of the Legendre Symbol this is equal to (5/71) 
(7|\71). The Quadratic Reciprocity Law and property 3 of the Legendre 
Symbol gives the following: 
(671) = (716) = (16) = 1, 
(7|71) = —(71|7) = —(1|7) = -1, 
and the product of the two values shows that 35 is a nonresidue of 71. 
We shall later remove some of the restrictions on p and gq with the result 
that computation can be shortened to (35/71) = -- (7185) = —(1|35) = 
—1, but we first shall prove our result when both numbers in the symbol 
are primes. 
To prove the law, notice that q odd implies from Theorem 6.3c that 
u = M (mod 2) and hence 
(qlp) = (-1)™. 
Thus, if N is that sum obtained from M by interchanging p and q, we have 
(glp) (pla) = (-1)**". 


Hence it remains to prove that 
(4) ΜΌΝ = (p-— l)(q-- 1) /4. 

We shall use Eisenstein’s geometric proof, which is as follows: On 
a rectangular coordinate system, let the line L connect the origin with 
the point (p, q). Let R denote the region bounded by the lines x = p/2, 
y = q/2 and the z and y axes, not including any of the bounding lines. 
(See the illustration for p = 11, g = 5.) 


BB Bay A 3. Ὁ ae 
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The proof consists in counting in two ways the lattice points (that is, 
the points with integral coordinates) within the region R. 

The simplest way to count them is to notice that the lattice points 
form a rectangle whose horizontal dimension is (p — 1)/2 and whose 
vertical dimension is (¢q — 1)/2. Hence R contains in its interior 


p-1 q-1 
3 Ὁ 


lattice points. We then shall establish (4) by showing that in R there are 


1. No lattice points on L, 
2. M lattice points below ZL, 
3. N lattice points above L. 


Hence it will follow that there are M + N lattice points in R and M+ N = 
(p — 1)(q- 1)/4. 

First, since the equation of L is py = gx and p and q are relatively 
prime, p must divide the z-coordinate of any lattice point on L, and q 
must divide the y-coordinate. Hence there is no lattice point on L closer 
to the origin than the point (q,p). This point is outside R. Therefore L 
contains no lattice points within R. 

Second, on the line « = 1 the y coordinates of the lattice points be- 
low L in R are 1, 2, ---, [¢/p]; that is, there are [¢g/p] lattice points in R, 
below Z and on 2 = 1. Similarly, there are [2¢/p] lattice points in R, be- 
low L and on x = 2. Hence the total number of lattice points in R and 
below L is 


La/p] + [2q/p]+ --- ἜΤ — 1)q/2p], 
which is just M. 
Third, the proof that N is the number of lattice points in R and 
above L is left as an exercise. 


Notice that an algebraic method of proof would be to consider the 
numbers 


(5) Py — Qt, 


as y takes on the values 1, 2, ---, (ᾳ -- 1)/2, and z = 1, 2, ---, (» — 1)/2. 
The number of numbers of (5) is (p — 1)(q — 1)/4 (whether or not they 
are distinct is immaterial). As above, we can show that none are zero. 
It can be shown that N of (5) are positive and M negative and hence that 
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(4) holds. The connection with the geometric argument may be seen, 
since the line L is py — qx = 0, and if (a, y) is any point above L, y > qu/p, 
that is, py — gz > 0 and, for points below L, py — qx < 0. 

Exampte 1. Find (71|73). Now (71/73) = (73|71) since 73 = 1 
(mod 4) and (73/71) = (2/71) by property 3 of the Legendre Symbol. 
Finally, (2/71) = 1 by Theorem 6.3b, since 71 =7 (mod 8). Hence 
(71|73) = 1, and 71 is a quadratic residue of 73. One could also evaluate 
the symbol as follows: (71/73) = (—2|73) = (—1|73)(2|73) = 1-1 = 1, by 
properties 3 and 2 of the Legendre Symbol, Corollary 6.1, and Theorem 
6.3b. 

Exampie 2. Find (55/73). Here (55|73) = (—18|73) = (—2|73) by 
properties 3 and 4 of the Legendre Symbol. We have just shown (—2|73) 
to be 1 and hence (55/73) = 1. 

Examptes 3. Find (186/301). Now (186/301) = (2|301)(3|301) 
(31|301). 


(2|301) = —1, since 301 = 5 (mod 8), 
(31301) = (8013) = (1/3) = 1, 
(31/301) = (301/31) = (22/31) = (—9|31) = (—1|31) = -1. 


Hence (186/301) = 1, and 186 is a quadratic residue of 301. 


Exercises 6.4 


1. Compute the following quadratic characters: 
(17|89), (73|89), (10137), (29]101). 
2. Use the Quadratic Reciprocity Law to do Exercises 4 and 5 of the 
previous section. 
3. Is 7 a quadratic residue of 65? Is 14 a quadratic residue of 65? 


4. In the proof of (4) show that N is the number of lattice points in καὶ 
and above L. 


5. Complete the algebraic proof of (4). 


6.5. The Jacobi Symbol 


There is an extension of the Legendre Symbol which shortens the 
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computation. The Jacobi Symbol is defined as follows for odd numbers 
P = pip. +++ py, Where the p’s are primes not necessarily distinct: 


(m|P) = (m|p1)(m|p2) - - - (mp). 


First, it should be noted that (m|P) = 1 does not imply that m isa 
quadratic residue of P. For instance, (765) = (7|5)(7|13) = 1, since 
(7|5) = (7|18) = —1. But the latter shows that 7 is not a quadratic resi- 
due of 5 and hence cannot be one of 65, since 22 = 7 (mod 65) solvable 
would imply z? = 7 (mod 5) solvable. For m to be a quadratic residue of 
P it must be one of each factor of P. However, if (m|P) = —1, one of the 
symbols (m|p:) must be —1, and hence m is a nonresidue of P. Thus 
(m|P) = —1 implies that m is a nonresidue of P, but if (m|P) = 1, then 
m is necessarily a quadratic residue of P only if P is an odd prime. 

The usefulness of the Jacobi Symbol depends on its having the six 
properties of manipulation of the Legendre Symbol, which we list in 


Theorem 6.5. The Jacobi Symbol (a|/P) for odd positive integers 
P has the following properties: 


. (a|P)(b|P) = (ab|P). 

. If a = b (mod P), (a|/P) = (Ὁ). 

. If (r,P) = 1, (ar*|P) = (a\P). 

. (ΞΡ) = (-p?-™. 

. ΩΡ) = (—1) PP, 

. (P|Q)(Q|\P) = (—1)!, where t = (P — 1)(Q -- 1)/4. 


The proofs of the first three properties are left as exercises. 
To prove Property 4, see that 


ao ork WN μ 


k 
(—1|P) = (—1|p1)(—1|p2) +++ (—1|px) = (—1)", r= Σ, (pi -- 1)/2. 
We have to show that r is of the same parity as (P — 1)/2. However, 
(P —1)/2= (-14+ pips -:+ p,)/2 = (-1+ 7 pd, 
i=] 


where Π' p: is the abbreviated notation for the product of the p’s, analogous 
ἐπὶ 


to the Σ notation for their sum. Now, 
k 
k 
πρὶ hut (op: —1+1) = 1+ dX (p; — 1) (mod 4), 


| 
| 
ἢ 
| 


| 
| 
d 
| 
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since any product of p; — 1 and p; — 1 is divisible by 4. Hence 
k 
(-1 + p)/2 = D (ps — 1)/2 (mod 2), 


and property 4 is proved. 
Property 5 is proved similarly, and this we leave as an exercise. 
To prove Property 6, notice that 


(P|Q) = x (pila), 


the product being over all prime factors p; of P and prime factors q; of Q, 
each repeated as often as it occurs in P or Q. Hence 


(P|Q)(Q|P) = ᾿ (φήαὴη(ᾳἠρὸὴ = (-1)4, 


where 
ΚΕ Υ ΣΤ EAs) μὰν 


But we have just shown above that the first sum on the right is congruent 
to (P — 1)/2 (mod 2) and the second sum to (Q — 1)/2 (mod 2). Hence 


(P|Q)(Q|P) = t = (P — 1)(Q — 1)/4 (mod 2), 


and property 6 holds. 
Two examples will show the advantages of the Jacobi Symbol. 
Exampte 1. Is 85 a quadratic residue of 97? Using the Legendre 


‘Symbol would require evaluating (5/97) and (17|97); each would be seen 


to have the value —1, and hence their product is 1. Using the Jacobi 
Symbol, we have 


(85/97) = (97/85) = (12|85) = (8/85) = (85/8) = (18) = 1. 


Notice that we have shown that (85/97) = 1 and hence that 85 is a quad- 
ratic residue of 97, since the latter is a prime. On the other hand, (12/85) 
= 1 does not imply that 12 is a quadratic residue of 85. In fact, 12 isa 
nonresidue, since (125) = —1. 

Examp.e 2. Find (171/173). It is equal to (173171) = (2|171) = —1. 
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1. Use the Jacobi Symbol to compute the quadratic character of a with 
respect to m for the following pairs of values: 


1 ἃ 51 93 535 781 | 1303 | 
am | 103* | 461* | 877* | 23805 2631 
The values of m with asterisks are primes. 


2. Prove Properties 1, 2, 3, and 5 of Theorem 6.5. 


3. Define an extension of the Jacobi Symbol as follows: If a is positive, 
define (a|—P) to be equal to (a|P). Show that this symbol has the prop- 
erties listed in Theorem 6.5, provided that in no symbol are both quanti- 
ties negative. Why is this restriction necessary? 

4. Use the results of Exercise 3 to find the primes of which —3 is a quad- 
ratic residue. 

5. Find the conditions that the congruence 


αχἢ + bx + ὁ = 0 (mod p) 


be solvable. 
6. Show that 2* =a (mod p’) is solvable for all primes Ὁ and all integers 
¢ if and only if a is the square of an integer. 
7. Show that zx? =a (mod m) is solvable for (a,m) = 1 if and only if 
a = b’c, where c has no square factors and is a quadratic residue of m. 
8. If P is an odd number, show that Property 1 of the Jacobi Symbol im- 
plies that 

(m|P) Σὺ (ΠΡ) = d GIP), 
where m is any integer prime to P and the sum is over all positive values 
of ὁ less than P. Hence show that if m can be chosen so that (m|P) ¥ 1 
(when may this be done?) it follows that 


DX GP) = 0. 
Does this mean that any composite number P has just as many quadratic 
residues as nonresidues? 


9. Using the method suggested in the previous exercise, show that if 
(a|p), is defined as in Section 2 and is not 1 for all a, then 


25 (z|p) = 0, 


where the sum is over all positive ὁ less than p. 
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10. Find all the values less than 40 taken on by the expression 
e+ 


for x and y positive integers. Record how many times each value occurs, 
and guess what numbers in general are represented by this expres- 
sion. 


6.6. Sums of two squares 


‘In this section we find what numbers are the sums of two squares, 
that is, for what integers m, the equation 


(1) v+y=m 


is solvable in integers x and y. The question may be reduced to consider- 
ation of m, a prime, by the following: 


Lemma 6.6a. If m and n are each sums of two squares, so is 
m:n. 
To prove this, let a?+b? =m, ?+d?=n. Then m-n= (a? + δ)" 
(c? + (Ὁ = (ac + bd)? + (ad — be)? 

The following is also useful: 


Lemma 6.6b. If Ὁ is an odd prime dividing m in (1), then 
xz = y = 0 (mod p) or p = 1 (mod 4). 


Suppose p = 3 (mod 4) and 2? + y? = 0 (mod p), with y prime to p. 
Then y’y = 1 (mod p) would be solvable for y’, and (y’x)? + (y’y)? = 0 
(mod p) implies (y’x)? = —1 (mod p), whereas —1 is a quadratic nonresidue 
of p. This shows that m cannot be a prime congruent to 3 (mod 4), in fact, 
cannot even be congruent to 3 (mod 4), since then it would have such a 
prime factor. 

On the other hand, all primes congruent to 1 (mod 4) are the sums 
of two squares, as is shown by 


Theorem 6.6a. If p is a prime congruent to 1 (mod 4), it can be ex- 
pressed as a sum of two squares in essentially one and only one way, that 
is, aside from the order and signs of z and y. 

The fact that such a prime is a sum of two squares will follow im- 
mediately once the truth of the following statement is established: If s is 
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an integer such that s? = —1 (mod p), (there is such an 8 since p = 1 
(mod 4) implies (—1|p) = 1), then the congruence 
(2) % = sy (mod p) 
has a solution 2 , yo with |2o| < Vp > |yo|. (See Exercise 9 in Section 2.5.) 
To prove this statement, let x and y range over the values 
0,1,2, ἐτὴ [Vp]. 
Then x — sy takes on 
- 2 
{[Vvp]+ 1} 

values, that is, more than p different values. Hence two of them must be 
congruent (mod p), and we have 

ὧι — Sy1 = L2 —8y2 (mod p). 
Thus, if we set % = σι — 22, Yo = Y1 — Ya, We see that x» and yo form a 
solution of the congruence (2) and z? and y? are both less than p. This 
completes the proof of the statement. 

For such a solution of congruence (2), we have 
0 = (0 — syo)(o + syo) = 22 — sy? = 22 + γῇ = 0 (mod p). 
That is, 23+ yi = kp. Furthermore, the above statement implies 22 < p, 
y2 < pand hence 2? + y2 < 2p. Thus k = 1 and pis the sum of the two 
squares x? and y?, (Ref. 10, pp. 188 ff.) 
To show the uniqueness of the solution, suppose 
(3) aty=p= a+ υἱ, 
where 21,41,%2,¥2 are all positive integers. Multiply the first equation by 
y? and the second by y?, and subtract to get 
yzet — yi} = p(y} — γῇ) = 0 (mod p). 

Hence 
(4) Yot, = + yit2 (mod p). 
Now, (3) implies that 71,22,41,y2 are all less than p. Thus (4) shows that 


Yoty = YrX2 OF Yoty + YiX2 = Ὁ. 


If the second equation holds, then 2172 — yiy2 = 0, for multiplication of — 


the equations in (8) gives 
p? = (Οὐ + y?) (a2 + 92) = (αν + Tayi)? + (waive — yaya)’. 
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Thus we have shown that (3), with 21,72,y1,y all positive integers, im- 
plies 


Διὰ = YrY2 OF Yori = Y1%2. 
However, (3) implies that (x1,y1) = 1, (%2,y2) = 1, and hence 
Ly = Yo, To = Yr OF Zi = L2,Yi = Ya 


and our proof is complete. 
Since 2? + 7? = 2 is obviously solvable, all the above results may be 
combined in the “if” part of 


Theorem 6.6b. A positive integer m is the sum of two squares if and 
only if every prime factor of m which is congruent to 3 (mod 4) occurs to 
an even power in m. ἡ 

Proof: Suppose p = 3 (mod 4) and divides m. Then by Lemma 6.6b, 
m = 2 + y? implies ὦ = px, y = pyi, and m = p*m,, where 

mM, = Zit γῇ. 
If m; is divisible by p, it must be divisible by p*, and so we can continue. 
If p occurred to an odd power in m, we would eventually have the impos- 
sible equation m, = x? + y’ to solve where p, but not p*, divides πιο. On 
the other hand, application of Lemma 6.6a shows that all numbers in 
which the primes congruent to 3 (mod 4) occur to an even power are repre- 
sentable as the sum of two squares. 


Exercises 6.6 


1. What circles 2? + y? = m have lattice points on them? 


2. Prove that if the lengths of the sides of a right triangle are integers 
whose g.c.d. is 1, all the prime factors of the length of the hypotenuse 
are congruent to 1 (mod 4). 


3. Find for what integers m, the following equation is solvable in inte- 
gers: 
x? + 2y? = m. 


How many solutions are there when m is a prime? 
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